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Abstract 

This paper investigates a family of dynamical systems arising from an evolution- 
ary re-interpretation of certain optimal control and optimization problems. We focus 
CN particularly on the application in image registration of the theory of metamorphosis. 

P5 Metamorphosis is a means of tracking the optimal changes of shape that are necessary 



, ^ for registration of images with various types of data structures, without requiring that 

the transformations of shape be diffeomorphisms, but penalizing them if they are not. 

^ This is a rich field whose possibilities are just beginning to be developed. In particular, 

metamorphosis and its related variants in the geometric approach to control and opti- 

Q mization can be expected to produce many exciting opportunities for new applications 

and analysis in geometric dynamics. 
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1 Introduction 

With the advent of new devices capable of seeing objects and structures not previously 
imagined, the realm of science and medicine has been extended in a multitude of different 
ways. The impact of this technology has been to generate new challenges associated with 
the problems of formation, acquisition, compression, transmission and analysis of images. 
These challenges cut across the disciplines of mathematics, physics, computational science, 
engineering, biology, medicine, and statistics. For example, in computational anatomy 
(CA) biomedical images are compared quantitatively by calculating the "distance" between 
them, along a path that is optimal in transforming one such image to another. The opti- 
mal path is traversed along a curve of deformations in the group of smooth invertible maps 
with smooth inverses (i.e., the diffeomorphisms) and it is governed by a partial differential 
equation (PDE) called the EPDiff equation. In particular, EPDiff governs the geodesic flow 
on the group of diffeomorphisms, with respect to any prescribed metric. This flow from one 
shape to another also has an evolutionary interpretation that invites ideas from the analysis 
of evolutionary equations. In particular, the momentum map for EPDiff identifled flrst in 
[TU] and explained more completely in [32] yields the canonical Hamiltonian formulation of 
the dynamics of the singular evolutionary solutions of EPDiff. Moreover, in an optimization 
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sense, this momentum map also provides a complete representation of the landmarks and 
contours (outlines) of images to be matched, in terms of the canonical positions and mo- 
menta associated with the evolutionary interpretation [13] . In addition, it provides a natural 
strategy for finding the optimal path between two configurations of either landmarks or con- 
tours [To]. Thus, the momentum map (a concept from Hamiltonian systems) is crucial in the 
construction of an isomorphism between the data structures used in the optimal matching 
of images and the evolutionary singular solutions of the EPDiff equation. This isomorphism 
has already suggested new dynamical paradigms for CA, as well as new strategies for assim- 
ilation of data in other image representations, for example, as gray-scale densities 170] . 
The converse benefit may also develop, in which methods of optimal control and optimiza- 
tion of data assimilation used in image matching for CA may suggest new strategies for 
investigating dynamical systems of evolutionary PDE. In short, the variational formulations. 
Lie symmetries and associated momentum maps encountered in applications of EPDiff have 
led to a convergence in the analysis of both its evolutionary properties and its optimization 
equations. 

This paper focuses on the evolutionary aspects of the PDE that are summoned by adopt- 
ing a dynamical interpretation of the optimal control and optimization methods used in the 
registration of various types of images. The paper does not perform any applications of op- 
timization methods to image registration, nor does it develop any numerical algorithms for 
making such applications. Instead, the paper re-interprets the endeavor of image registration 
from a dynamical systems viewpoint. In particular, as we shall explain, a recent develop- 
ment in the large deformation diffeomorphic matching methods (LDM), in an approach for 
image registration called metamorphosiJ^ [601 EHl SS] introduces a new type of evolutionary 
equation that may be called optimization dynamics. In following this line of reasoning, the 
geometric mechanics approach for evolutionary PDE provides a framework that we hope 
will inform both optimization and dynamics. The primary example in the line of reasoning 
leading to optimization dynamics is the EPDiff equation [^ H2| 170] . 

A brief history of the EPDiff equation 

EPDiff stems from the recognition by Arnold in [Ij that incompressible fluid dynamics could 
be characterized as geodesic flow in the group of volume preserving diffeomorphisms, with 
respect to the kinetic energy metric (L^ norm of the fluid velocity). A few years later, 
the one-dimensional compressible version of EPDiff reappeared as the dispersionless limit 
of the Camassa-Holm (CH) equation [TH]. The CH equation is a completely integrable 
evolution equation for shallow water waves, whose soliton solutions develop sharp peaks in 
the dispersionless limit. Its peaked soliton solutions (peakons) correspond to concentrations 
of momentum into delta-function singularities and are solutions of EPDiff in one dimension 
with the kinetic energy metric. Slightly later, the incompressible version of EPDiff 
with the kinetic energy metric was generalized to higher dimensions in [HI |12] by using 
its symmetry-reduced variational principle, and was interpreted as Euler's fluid equations, 
averaged following Lagrangian particle trajectories. This interpretation soon led to the 

^Although the term "metamorphosis" has a precise mathematical definition that wiU be given below, it 
also satisfies its proper dictionary definition, as "a change of physical form, structure or substance" . This 
paper interprets the change as a type of evolution. 
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introduction of viscosity and some interesting applications of the resulting viscous equations 
as a turbulence model by Chen et al. [191 120] • 

Around the same time, EPDiff arose independently in a completely different context. 
Namely, it arose as the governing equation in the optimization problem for large deformation 
diffeomorphic matching (LDM) in image registration [HU [65l EH]. The recognition that 
EPDiff was arising in these two different contexts provided a fruitful opportunity for dual 
interpretations of the solutions of the same equation. In particular, the "peakons" of the CH 
equation in the water wave context were soon recognized to be the "landmarks" in images in 
the LDM context. Since then, the two types of problems have continued their optimization- 
dynamics interplay and have been found to inform each other, while also showing intriguing 
differences and similarities that arise in their dual formulations as initial value problems on 
one hand and boundary value problems on the other. In particular, the concept of symmetry 
reduction and momentum maps from geometric mechanics that had previously been applied 
so effectively in fluid dynamics pQ and shallow water soliton theory [TO], has recently been 
recognized as a unifying approach for developing multi-mode LDM methods for images whose 
data structure may comprise arbitrary tensors, or tensor densities [15]. This is a rich and 
rapidly developing area of science, for which a complete literature review would be beyond 
our scope here. 

A convergence of these two independent endeavors has led to dual interpretations of 
the same equation and the same key ideas in such different but complementary contexts. 
This convergence is fascinating, and we continue our investigation of it here. In the present 
paper, we emphasize the dynamical interpretations of the equations and approaches that 
are applied in optimal image matching. This is not to say that we solve optimal matching 
problems for images at all in this paper. Rather, being cognizant of the ideas and variational 
formulations underlying the optimal matching approach, we shall apply these formulations 
to study certain classes of equations that arise in the problem of image registration, not 
from the viewpoint of optimization, but rather from the evolutionary viewpoint of geometric 
mechanics [HI [53]. 

The geometric mechanics approach emphasizes Lie group actions on manifolds, momen- 
tum maps, and reduction by symmetry. This approach leads to an understanding of certain 
classes of control and optimization problems as systems of evolutionary equations. In par- 
ticular, the Lie symmetry ideas underlying the process of optimal image assimilation known 
as metamorphosis [601 ESI SS] in combination with the evolutionary geometric mechanics 
viewpoint leads the family of EPDiff equations into the realm of optimization dynamics. Op- 
timization dynamics extends the previous association of image matching ideas with soliton 
theory [13] to produce new results, such as the derivation and re-interpretation of the two- 
component CH system (CH2) as a equation for the dynamics of metamorphosis of gray-scale 
images [16]. The CH2 system is a completely integrable evolutionary system of equations 
that was recently discovered using isospectral methods for solitons [21]. Its inverse scat- 
tering transform is discussed in [27]. Recognizing that some systems of equations arising 
in optimization dynamics for image analysis may be associated with soliton theory raises 
many questions about the mathematical properties of these systems and their solutions, 
particularly when the equations are nonlocal. For example, the initial value problems for 
some of the nonlocal equations obtained in optimization dynamics investigated here allow 
emergent singular solutions, in which the evolution of a smooth, spatially confined, initial 
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condition becomes singular by concentrating itself into delta function distributions. In par- 
ticular, EPDiff has that property and so does the corresponding system of equations for 
the optimization dynamics of metamorphosis. See [HI 1121 El] and [SnUZO], respectively, for 
further discussions of EPDiff from the different but complementary viewpoints of geometric 
mechanics and image matching. 

1.1 LDM approach, EPDiff, and momentum maps 

The LDM approach is based on minimizing the sum of a time-integrated kinetic energy 
metric whose value defines the length of an optimal deformation path, plus a penalty norm 
that ensures an acceptable tolerance in image mismatch. (The matching cannot be exact 
because of the unavoidable errors that arise in real applications.) LDM approaches were 
introduced and systematically developed in Trouve [Ml ES], Dupuis et al. [24j, Joshi and 
Miller |17], Miller et al. [60l |59], Beg g], and Beg et al. [5j. The LDM approaches of 
those papers are based on Grenander's deformable template paradigm for image registration 
[31j. Grenander's paradigm, in turn, is a development of a biometric strategy introduced by 
D'Arcy Thompson |l63j of comparing a template image /q to a target image Ji by finding a 
smooth invertible transformation of coordinates th at maps one image to the other. This 
transformation is assumed to belong to a Lie group G of diffeomorphisms that acts on the 
set of templates containing Jq and Ji. The effect of the transformation on the data structure 
that is encoded in the set of templates is called the action of the Lie group G on the set 
of images. The optimal path in the transformation group is the one that costs the least in 
time-integrated kinetic energy for a given tolerance. This concept of optimization summons 
a control theory approach into the analysis and registration of images. 

In applications of the LDM approach, the optimal transformation path is often sought by 
using a variational optimization method such as the one developed in [211 ISH |65] . Using this 
method, the optimal path for the matching transformation in this problem is obtained from 
a gradient-descent algorithm based on the Euler-Lagrange equation arising from stationary 
balance between kinetic energy and tolerance. This gradient-descent approach does indeed 
determine an optimal matching path. However, from the viewpoint of dynamical systems 
theory, it misses the following potentially interesting question: 



What information and perspective might be obtained by interpreting the 
Euler-Lagrange equations associated to the LDM approach from a dynam- 
ical systems viewpoint? 



The answer to this question may be sought by interpreting the variational optimization 
method in the LDM approach as a form of Hamilton's principle. Hamilton's principle for the 
variational construction of optimal paths with minimal kinetic energy for a given tolerance 
in image mismatch yields an associated set of Euler-Lagrange equations that may then be 
given an evolutionary interpretation. The optimal solutions of these equations have been 
investigated as evolutionary motion on the Lie group of diffeomorphisms in the absence of 
additional penalty terms by Arnold [U [2], Holm et al. [HI |12], Marsden and Ratiu [53] . 
and for the particular application to template matching in Miller et al. [5^. As mentioned 
earher, the optimal paths in these cases are geodesies with respect to the metric provided by 
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the kinetic energy. The kinetic energy for LDM is invariant under right translations on the 
diffeomorphism group. Reducing Hamihon's principle with respect to this symmetry and 
then invoking the Euler-Poincare theory applied to diffeomorphisms produces an evolution 
equation known as the EPDiff equation [HI |12] , whose derivation in the present context is 



explained in Section 8.4 



The solution of the EPDiff equation yields the spatial representation of the geodesic 
velocity, i.e., the tangent vector to the optimal path of deformations along which the minimal 
distance from one image to another is measured. The geodesies themselves may be obtained 
from the solutions of EPDiff for the velocity by a reconstruction process that inverts the 
previous reduction by symmetry after the solution to the EPDiff equation for velocity has 
been obtained. This is analogous to the reconstruction process in classical mechanics that 
recovers the symmetry coordinate conjugate to a conserved momentum as the final step in 
the solution, after the other degrees of freedom have been determined in the reduced space. 

Composing the evolutionary solutions of EPDiff with the reconstruction process provides 
an important representation of diffeomorphisms that relates the endpoint of a geodesic to 
the initial value for momentum in the EPDiff equation. This relation is the momentum 
representation of the deformation. The long-time existence of this representation is based 
on conservation by EPDiff of the kinetic energy norm, which may be chosen so that its 
boundedness affords enough smoothness on the velocities to ensure the long-time existence 
of solutions of EPDiff. In this case, EPDiff admits emergent weak momentum solutions; 
for example, delta-function distributions of momentum that emerge from smooth, spatially 
confined initial conditions fl6\ 139] . This singular behavior is well understood analytically 
only in certain one-dimensional cases. In particular, it is understood for the completely 
integrable case of the Camassa-Holm equation, see, e.g., [211 EI] and references therein. 

The EPDiff equation is of central importance in computational anatomy [TD]. This is 
because the optimal paths sought by LDM on the image template space defined on a manifold 
M are inherited from the geodesies on Diff(M), the Lie group of diffeomorphisms acting on 
the manifold M. These, in turn, are governed by EPDiff. Consequently, any solution of the 
LDM problem for optimal geodesies must involve EPDiff [TO]. Conversely, solving the LDM 
problem directly produces the momentum representation of the optimal diffeomorphism. The 
momentum representation arising from this evolutionary interpretation is then available for 
analyzing anatomical data sets. In any case, despite the disparate forms that the geodesic 
equations may take for the various data structures in the various types of images, all of them 
are instances of EPDiff with the corresponding representation for momentum. The specific 
representation for momentum in terms of the image data structure in a given case is called 
the momentum map. The momentum map for images is another dynamical systems concept 
that emerges as a central feature in this paper. The EPDiff equation and its associated 



momentum map for various image data structures are discussed in Section 8.4 



An interesting example of the momentum map relating solutions of LDM to solutions 
of EPDiff arises for the case of landmark data structure, in which the momentum is sin- 
gularly concentrated at points. The relation between these singular geodesic solutions and 
evolutionary soliton solutions, called peakons for a shallow water wave equation introduced 
in Camassa and Holm [16], has been examined in the context of computational anatomy in 
Holm et al. [13]. A numerical analysis of the stability of these equations is also given in 
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McLachlan and Marsland [56]. See also Micheli [57] for other recent developments involv- 
ing the curvature of the space of landmark shapes. Holm and Marsden [39j explain that 
two independent momentum maps for EPDiff are available in the case that the image data 
structure comprises the manifold Emb(5'^,M^) of embedded closed curves (embedded images 
of S^) in the plane M^. The left action of the group of diffeomorphisms Diff(]R^) of the 
plane deforms the curve by a smooth invertible transformation of the coordinate system in 
which it is embedded, while leaving the parameterization of the curve invariant. The right 
action of the group of diffeomorphisms Diff (S*^) of the circle corresponds to smooth invertible 
reparameterizations of the domain of the coordinates of the curve. In this case, one mo- 
mentum map corresponds to action from the left by the diffeomorphisms on M^, the other to 
their action from the right on the embedded curves. Optimal control and reparameterization 
methods for matching closed curves in the plane using these two momentum maps for the 
space of closed curves in the plane have recently been developed in Cotter and Holm [23] . 

In summary, LDM image analysis is based on optimization methods that are formulated 
as boundary value problems. However, the re-interpretation of their governing equations as 
evolutionary systems by using symmetry reduction of the corresponding Hamilton's princi- 
ple allows various concepts from dynamical systems theory to be profitably applied in the 
solution and interpretation of image analysis problems. Thus, the transfer of concepts and 
ideas between these two fields in the context of image registration has the potential to enrich 
them both. 

1.2 Distributed optimization dynamics, or evolutionary metamor- 
phosis 

As we have been discussing, the paper focuses on the geometric dynamics interpretation of the 
optimization problems designed for image registration. However, rather than concentrating 
on the development of solutions of optimization problems, the treatment here focuses on the 
dynamics that are produced in applying the method of reduction by Lie group symmetry 
to families of optimization problems posed in a geometric setting. This is a new arena 
for geometric dynamics and several new departures are being taken. Among these new 
departures is the investigation of the evolutionary dynamics that arises when distributed or 
nonlocal penalties are imposed in Hamilton's principle, rather than local constraints. For 
lack of a better name, we call this sort of problem distributed optimization dynamics. It is 
the evolutionary counterpart of the metamorphosis approach in imaging science [601 EEl US] , 
which, in turn, is a modification and development of LDM that allows the evolution n{t) of 
the image template to deviate from pure deformation. That is, metamorphosis only penalizes 
the spatial average of the deviation away from the infinitesimal action of the vector fields on 
an image manifold, rather than enforcing it as a local pointwise constraint. This approach, 
in turn, modifies the EPDiff equation and thereby introduces a wealth of new structure and 
new examples that we shall investigate in this paper. 

An explicit comparison for the case that the image templates are gray-scale density 
distributions may help to understand the difference between the LDM approach and the 
metamorphosis approach. 

LDM approach: Given the source and target templates for the images characterized as 
scalar densities Uq and nx at the initial time t = and the final time t = T, respectively. 
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minimize the quantity 



^ 1 

i{u{t))dt+ —\\noor]-' -nrWh, i^X 



over the time dependent vector field u{t), where 7]t is the fiow of u{t) evaluated at time 
t = T, and the formula 

h{t) + div {n{t)u{t)) =0 

is its infinitesimal action on a smooth density n{t) = nQor]^'^ defined over time 0<t<T on 
the domain of fiow. 

Metamorphosis approach: Given Uq and rix, minimize 

iHt)) + ^\\h{t) + diY{n{t)u{t)) dt (1.2) 

over time dependent vector field u{t) and scalar densities n{t). As one sees in Figure [lT] 
for the metamorphosis of shapes characterized as densities, the term "metamorphosis" in- 
troduced in [66j for this process can be understood in practice by its ordinary meaning, as 
"change of shape" , such as the gradual and continuous metamorphosis of a tadpole into a 
frog. 




• o O OOO 



Figure 1.1: These gray-scale images show optimal metamorphoses between two density 
distributions with equal total mass from [56]. The optimization approach would compute the 
distance along the optimal path between between the first and last density in each row. In the 
evolutionary approach, the optimal trajectories for n{t) are computed. The images between 
the endpoints show snapshots along the optimal path n{t) in each row at intermediate points 
in time. In particular, the second row shows that metamorphosis allows a change in topology 
along its optimal path. Our interest focuses on the evolutionary equations for the process of 
metamorphosis. The dynamical system of metamorphosis equations obtained in registering 
such gray-scale image densities is given in Section |8] as one of the examples of the general 
approach. In one dimension, the metamorphosis equations for this class of images comprises 
a completely integrable Hamiltonian system. 
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The paper begins by contrasting optimal control problems with distributed optimization 
problems in a geometric setting. In particular, we discuss the geometric properties of Lie al- 
gebra controls acting on state space manifolds. The latter optimal control approach parallels 
the familiar Clebsch variational formulation of dynamical equations continuum mechanics 
(e.g., [38]). In fact, continuum mechanics was one of the early paradigms for image regis- 
tration [68]. The Clebsch variational formulation of continuum mechanics has recently been 
developed and apphed in the study of the dynamical aspects of optimal control problems 
in a geometric setting (see [2H1 ES])- Conversely, our concern here is to continue this par- 
allel development by studying the implications for dynamics of the geometric approach to 
distributed optimization problems. 



1.3 Plan and main contributions of the paper 

In the remainder of the paper, we compare the dynamical equations that arise from optimal 
control problems with those arising from distributed optimization. This comparison provides 
several examples of how the two approaches differ and, in particular, how their dynamical 
equations differ when their variational problem is regarded as Hamilton's principle for the 
dynamics. Their comparison also identifies the aspects of these approaches that are funda- 
mentally the same. Section [2] begins by explaining the dynamical set up for standard optimal 



control problems treated by the Pontryagin Maximum Principle. Section 2.2| provides sev- 
eral examples illustrating the consequences of applying Lie group controls acting on state 
manifolds by using the Clebsch framework for optimal control. These examples introduce 
the momentum map for the cotangent-lifted action of the Lie group controls on the state 
manifold. The cotangent-lift momentum map is a fundamental concept in the application of 
geometric mechanics methods in the Clebsch framework for optimal control. It turns out that 
the same momentum map is also the organizing principle for the distributed optimization 



dynamics introduced in Section |2.3 After establishing this background for our comparison 



of optimization and dynamical systems methods. Section |2.4| provides an overview of the 
rest of the paper. 

Section [3] begins by reviewing the Clebsch framework for optimal control problems intro- 
duced and studied in [28]. A new class of optimization problems is then introduced which 
is the subject of study of this paper. The stationarity conditions are obtained and the asso- 
ciated equations of motion are determined. Inspired by the extremum problems presented 
earlier. Section |4| presents two Lagrangian reduction procedures for Lagrangian functions 
defined on T{GxQ), where G is a Lie group acting on the manifold Q. These reduction 
methods are used in Section [5] to rederive the equations of motion that were found in Section 
[3| Hamiltonian reduction is carried out in Section [6| As before, there are two reduction 
methods and, in the case of a representation, one of them leads to Lie-Poison equations with 
a symplectic cocycle on the dual of a larger semidirect product Lie algebra. In Section [7] we 
apply these Hamiltonian reduction methods to the optimization problems introduced ear- 
lier. Section [8} by far the longest of the paper, presents a number of examples. We begin by 
studying examples where G is represented on a vector space. The concrete examples treated 
are the heavy top and a class of problems using the adjoint representation. For example, we 
find a modification of the pair of double bracket equations studied in [7j, [8j. Next, we study 
optimization problems associated to affine actions. Actions by group multiplication is the 
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next topic. The concrete examples include the A^- dimensional free rigid body, Euler's equa- 
tions for an ideal incompressible homogeneous and for a barotropic fluid. The iV-dimensional 
Camassa-Holm equation is presented from this optimization point of view, inspired by the 
construction of singular solutions. Finally, the optimization problem is used to obtain the 
equations of metamorphosis dynamics for use in computational anatomy. Section [9] briefly 
summarizes the paper and gives an outlook for future work. 



2 Review of optimal control problems 
2.1 Definitions 

We begin by recalling the definition of optimal control problems. 

Definition 2.1. {Optimal control problems) A standard optimal control problem comprises: 

• a diff'erentiable manifold Q on which state variables n&Q evolve in time t during an 
interval /=[0,T] along a curve n:I^Q from n(0)=no to n{T)=nT, with specified 
values UcUt^Q; 

• a vector space U of control variables ueU whose time dependence u\I^U is at our 
disposal to affect the evolution n{t) of the state variables; 

• a smooth map F :Q xU -^TQ such that F{-,u) : Q ^TQ is a vector field on Q for any 
ugU whose associated evolution equatioi^ 

h = F{n,u) (2.1) 
relates the unknown state and control variables {n{t),u(t)) :I ^QxU; 

• a cost functional depending on the state and control variables 

S:= I i{u{t),n{t))dt, (2.2) 



subject to the prescribed initial and final conditions, at n(0) = nQ and n{T) = nT- The 
integrand i:Q xU ^M., called the Lagrangian, is assumed to be on QxU . 

The goal of the optimal control problem is to find the evolution {n{t),u{t)) of the state and 



control variables such that S is minimal subject to the prescribed dynamics (2.1) and the 
prescribed initial and final conditions n{0) = no, n{T) = nT- 



^The over-dot notation in h means time derivative. Several forms of time derivative appear in applications 
and the meaning should be clear from the usage. Besides the over-dot notation, we shall use the equivalent 
notation d/dt to mean either partial or ordinary time derivative in the abstract formulas, as needed in the 
context. For fluids, we shall also use dt for the Eulerian time derivative at fixed spatial location. Finally, 
the covariant time derivation on a Riemannian manifold will be denoted as D/Dt. 



Gay-Balmaz, Holm and Ratiu 



Geometric optimization dynamics 



11 



The coupling between the control and state variables may be made explicit by using 
the pairing (-, ■)g : T*Q x TQ ^M. and a Lagrange multiplier a G T*Q that imposes the state 
system as a constraint on the cost functional, 



Sr. 



i{u,n) + {a,h — F{n,u))^ 



dt. 



(2.3) 



This is a consequence of the well-known Pontryagin maximum principle [6l HH] . 

The variable a&T*Q is called a costate variable. We now compute the equations asso- 
ciated to the variational principle 5S'c = 0. For simplicity, we suppose here that the state 
manifold Q is a vector space, say W. In this case the cotangent space is T*W = W xW* 
and the costate variable is of the form a = {n,p) & W x W*. The stationary variations of the 



constrained cost function Sc in (2.3) yield 



= 5Sr 



-|- {6p,h — F{n,u)) 



w 



dt+{p,5n)Q 



where {■ , ■)u - U* x U ^M. denotes the duality pairing for the control vector space U. 

Stationarity in the variations 6u gives a relation that determines the controls u in terms 
of the state and costate variables, n and a, respectively, while stationarity in the variations 
{6n,6a) determines the evolution equations for the state and costate variables that minimize 
the cost function S. Since the values of n at the endpoints in time are fixed, 6n vanishes at 
the endpoints. We thus get the stationarity conditions 



6u 



n - 



■■F{n,u), 



P-- 



5^_(5F_ 
5n \ 5n 



p. 



Remark 2.2. Although we shall confine our considerations to the Lagrangian description, 
we point out that the relation to the Pontryagin Maximum Principle in the Hamiltonian 
description is obtained via the Legendre transformation of the integrand in the cost functional 



given by (2.3) which, for each point u in the control space t/, defines the corresponding 
Hamiltonian if„ : r*Q ^ M by 



Hu{an) = {an,F{n,u))r)-i{n,u). 



(2.4) 



The notation a„ for a covector in T*Q means that it belongs to the fiber T*Q of the cotangent 
bundle. For more information about the Hamiltonian approach to geometric optimal control 
theory and the Pontryagin Maximum Principle, see [HlllFI- ♦ 



2.2 Examples: Lie group controls acting on state manifolds 

As an example that illustrates the theory developed in this paper, we consider the case of 
continuum mechanical systems with advected quantities; see Section 6 in [H]. In this case, 
the state manifold M is some vector subspace V* of T(r') ^Den{V), the tensor field densities 
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on a manifold V. We will denote by a&V* these tensor field densities. The group Diff(I') 
of all diffeomorphisms of the manifold V acts on V* by pull back, that is, 

a^ri*a = aori, for all ?7GDiff(P). 

It is thus a right representation of Diff(P) on T(I^) (8>Den(D). We consider here the group 
Diff(I') of diffeomorphism as an infinite dimensional Lie group (either formally or in some 
Frechet sense) whose Lie algebra is given by vector fields vgX(P). The right action of the 
Lie algebra on V* is given by the Lie derivative 



where t\-^exp(t\) denotes the flow of v. 



exp(tv)*a := £^a, 



Example 1 

We present a simple example of optimal control problem based on the geometric formulation 
of continuum mechanics described above. In this example, the control space U is the Lie 
algebra and thus the control variable is a vector field u : = v G X('D). The state manifold 
Q is the vector space V* of tensor field densities. The state variable n:=aEV* is constrained 
to evolve according to the ODE 

d = F(a,v) := £y,a 



and one wants to minimize 



S:=- I \WEdt 



2 

where || ■ ||g is an inner product norm on the Lie algebra q = X{V). Note that we are in the 



setting of Definition 2.1 with M = V* and U = X{V). This is an example of a Clebsch optimal 
control problem, as studied from a geometric point of view in [2H]- For this class of problems, 
the vector field F is given by the infinitesimal generator associated to a group action on the 
state manifold. In the present example, this infinitesimal generator turns out to be the Lie 
derivative. 



According to (2.3), the constrained cost function in this case is 



'S'c = ^ (^\\^r\\l + {p,a- £,,a)v^dt, 

where pEV is the costate variable. This is nothing else than the Clebsch approach to 
continuum mechanics; see, e.g., [38j. The variational principle 6Sc = gives the control 

v = -(poa)*G0, 

where ^ : 0* — )■ is the sharp operator associated to the inner product on g and the bilinear 
operator o:V xV* ^ q* is defined by 



{poa,\) ■.= —{£^a,p), for all pEV, aeV*, vG0. 



(2.5) 
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The other stationarity conditions are 



where £lpGV is defined by 

(^a,£lp) = {£va,p) , for all pGV, aeV*, vGg. 



(2.6) 



(2.7) 



The Clebsch state-costate equations (2.6) are canonically Hamiltonian with 



As is well known, [38] , using the cotangent-lift momentum map given by 11 = —poa to project 
the equations (2.6) on T*M to g*, yields the (left) Lie-Poisson bracket on the dual Lie algebra 
Q* . Explicitly, this Lie-Poisson bracket is given by 



n = ad5;,/OTn = adn»n 



where the Hamiltonian has the expression 



Mn) = -(n,n»>^ 



(2i 



(2.9) 



Example 2 



This example will use the geometric setting of continuum mechanics as described before. 
However, the control vector space will now be given by f/: = g x V^* 3 (v,z/). We choose the 
quadratic Lagrangian 

^(v.-):=i||v|l^ + ^IIHIi^. 

where || ■ ||l2 denotes an norm on V* C T(P) ® Den(T'). As before, the state manifold Q 
is V* and the state variable a&V* is constrained to evolve as 

d = F(a,v,i/) := £^a + i'. 

Note that the advection law a = £^a is not imposed. Instead, the penalty term in the 
Lagrangian introduces the additional term v into the advection law. 
Thus, the constrained action (2.3) becomes in this case 



(2.10) 



whose stationary variation results in 

nT 

= 5Sc 



£lp-p,Sa)^ + {\^+poa,S\) 



+ { —p,Si^) + {Sp,d — £va — i^)v dt+{p,6a)^ 



V 
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where the flat operators b : g — )■ g* and \):V* —>V are associated to the inner products on g 
and V*, respectively. Here the endpoint terms vanish because the values of a at the endpoints 



in time are fixed. According to the variational formula for 5Sc, the cost functional in (2.10) 
is optimized when the controls satisfy 

v = -(poa)«Gs and u = a^p^eV*, (2.11) 

in which the sharp maps are the inverses of the flat maps defined above. For the controls 
(v, u) EqxV*, the state and costate variables (a,p) eV* xV evolve according to the following 
closed system 

These are Hamilton's canonical equations for the Hamiltonian 

Hip,a)=^-{ipoa),ipoaf)^ + ^{p,p^)^. (2.13) 

Remark 2.3. Thus, the evolution of the state a and costate p variables occurs by the corre- 
sponding Lie derivative actions of the vector field (poa)" G = X(P) calculated by applying 
the sharp map jl to raise indices on the cotangent momentum map {a,p) G V"* x V = T*V* ^ 
J(a,p) = —poa G 0* of the cotangent-lifted action. ♦ 

The evolution of the momentum V* xV -^Q* itself is the last formula to be found, just 
as in the Clebsch approach, [38] . 

Proposition 2.4. Denote the momentum map of the cotangent-lifted action by 

n := —poa 

and its dual vector field by 

v:=-(poa)" = H«. 

Then the state and costate equations (2.12) imply the following Euler-Poincare equation f 
the evolution for the momentum map: 



or 



tl = -£lU-a^pop\ (2.14) 

where the operator £\ : 0* — is defined by (i)*H,u) := (H, [v,u] jl) for any u, v G g = X(r'), 
H G 0* = r2^(I?) (8)Den(T') and [v,u]jL = i^vU denotes the standard Lie bracket of vector fields. 



Proof. The proof proceeds by a direct calculation. In the computation below we use 
the standard Jacobi-Lie bracket of vector fields [X,Y]jL{f)=X{Y{f)) — Y{X{f)) for any 



Gay-Balmaz, Holm and Ratiu 



Geometric optimization dynamics 



15 



f eC°°(V). For a fixed Lie algebra element Z Eg = X{V) we compute, 

Il,Z ) = — {poa+poa,Z) 

-- {p,£za) + {p,£zd) 

-- -{£lp, £za) + {p, £z£wo) + cr^ (p, £zP^) 
--{p,£[z,^]a)+a'^ {p,£zP^) 
---{poa,[Z,w])-a^pop\Z) 
---{Il,£^Z)-a\pop\Z) 
---{£lIi,Z)-a^{pop\Z), 



which proves the Proposition 2.4 



Remark 2.5. (Lie algebra formulation of the equations) Recall the the Lie algebra bracket 
[u,v] =aduV on g is minus the Lie bracket of vector fields, that is, 

[u,v] = -[u,v]jl:=-(u- Vv-v- Vu). 

We may thus identify £*^ = — ad* and the previous equations can be rewritten as 

Il = ad;n-(TVop^ 
a = —£y,a + a'^p^ , 

p = £lp. 

These are Lie-Poisson equations with a cocycle for the Hamiltonian 



(2.15) 



/z(n,a,p) = -(n,n«>^ 

with respect to the Lie-Poisson bracket given by, 

adnll aon —poB 
-£na 1 
£lp -1 



a 

Y 



(2.16) 



"n" 




a 




_P _ 





dh/dU = Ui = w 
dh/da = 
dh/dp = a'^p^ 



(2.17) 



in which the variational derivatives of the Hamiltonian are to be substituted into the cor- 
responding places indicated by a box (□). This matrix is identified as the Hamiltonian 
operator for the Lie-Poisson bracket dual to the semidirect product Lie algebra 0(S)(\^* x V) 
plus a symplectic 2-cocycle on (a,p) & V xV* . ♦ 



Remark 2.6. This Hamiltonian matrix will block-diagonalize in the Lagrange-Poincare 
formulation discussed in Section |4j Roughly speaking, this amounts to transforming variables 
H — 7- H := (H+j90a) and (a,z/) — )■ (a,d). ♦ 
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Example 3 



We now consider an example analogous to the preceding one but in finite dimensions. We 
let the orthogonal group G = S0{3) act on by matrix multiplication on the left and we 
choose U: = so{3) (f2,z/) as control space. As usual, we identify the Lie algebra so(3) 



with W. We choose the quadratic Lagrangian i:so{3) x '. 



given by 



(n, v) ■= -m ■ n + -^Ku ■ u, 



for symmetric positive definite matrices I and K. We impose the evolution equation 

X = -rixX + z/ (2.18) 
for the state variable Xg]R^=:(5- As before, the variational principle 5S'c = with 



5. 



1. 



1 



2 2a2 



X + rixX-z/ dt 



yields the controls 



iri = PxX and Kz/ = (t^P, 



as in ( |2ll| . Note that = I-i(P x X) = (P x X)« and K^^P = P«, by the definition of the 
sharp maps. Then the state and costate evolution equations (2.12) take canonical Hamilto- 
nian form with Hamiltonian function 



i/(X,P) 



1 2 

-(PxX)-(PxX)«+yP-P" 



(2.19) 



Intriguingly, the resulting canonical Hamiltonian equations, 

-(PxX)«xX + a2p«, 

= -(PxX)«xP, 

involve the double cross product of the state and costate vectors (X,P) G M'^ x M^. 
double cross products correspond to the Lie derivatives in equations (2.12) which for this 
case become cross products. For more information about the roots of the Hamiltonian 
approach in geometric control theory, see [3j. 




(2.20) 



The 



Upon defining the vector n:=If2 = P x X, equations (2.20) imply 

ri = -rixn-(T2(K-ip) xp, 
x = -rixx+(T2ptt, 

p = -rixp, 



(2.21) 
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which recovers the momentum map system (2.15) for this case. Indeed, one may compute 
directly that 

ri = Pxx+Pxx 

= {-n X P) X X+P X {-n X X+a^P*) 

= (Pxri) xx+(rixx) xP+fT^pxp" 

= -(XxP) xfi + a^Px (K-^P) 

= nxri+(T2px(K"^p), 

from which the resuh follows. 



Remark 2.7. (Lie algebra formulation) The Lie algebra bracket on se(3) ~so(3)©]R^ may 



be written on ]R3 X ]R3 as, 



ad(n,ct)(f2,Q;) 



Its dual operation is 



flxfl,flxa — flxcx. 



ad^„,„)(n,P) 



rixn-cKxP-rixP 



)■ 



In terms of the ad* operation on se(3)*, the motion equations for (n,P) in (2.21) can be 
rewritten as 



(n,p) = (^-rixn-(T2pflxP,-rixP 

= (^ad^n + (T2poP«,-rixP) 



: ad 



(n.p). 



The result of the last calculation may be rewritten in Lie-Poisson bracket form as 

(ri,P)=ad*. ^(^,P), 

^ ^ [dh/dn,dh/dF) ^ ' ^ ' 



(2.22) 



with Hamiltonian (2.19) rewritten in these variables as 

/i(n,p) = -n-n» + — p-p^ 

^ ' ' 2 2 



(2.23) 



and using the (left) Lie-Poisson bracket defined on the dual Lie algebra sc(3)*. This is the 
Hamiltonian and Lie-Poisson bracket for the motion of an ellipsoidal underwater vehicle in 
the body representation. See, e.g., [33] for more discussion and references to the literature 
about the geometrical approach to the dynamics and control of underwater vehicles. ♦ 



We have seen that equations (2.20) for the state-costate vectors (X,P) are canonically 
Hamiltonian and that the system (2.22) for (n,P) is Lie-Poisson on the dual of a semidirect 



Gay-Balmaz, Holm and Ratiu 



Geometric optimization dynamics 



18 



product Lie algebra. Now, it remains to include the dynamics of the coordinate X into a 
single structure for the entire system (2.21) for (n,X,P). We observe that equations (2.21) 
may be put into Lie-Poisson form, as 



n " 




"nx xx Px" 




'dh/dn 




X 




Xx 1 




dh/dlL 




p 




Px -1 




dh/dP 





nx Xx Px 

Xx 1 
Px -1 














a2p« 



(2.24) 



This is the Lie-Poisson bracket dual to the semidirect product Lie algebra 5o(3)(S)(l 
plus a symplectic 2-cocycle on (X,P) gM^ x M^. 



Remark 2.8. As mentioned earlier, the Lagrange-Poincare and Hamilton-Poincare formu- 
lations in Sections |4] and |6] will block-diagonalize this Hamiltonian matrix. ♦ 



Remark 2.9. (Comparison of the examples) The major difference between Example 1 and 
Examples 2 and 3 is the following. In Example 1, we impose the advection equation a = £^a 
as a constraint on the minimization problem. This is done, as usual, by introducing a new 
variable p and adding the term {p,a — £va) in the action functional. In Examples 2 and 3, 
the advection law is not imposed exactly, but only up to an error term 

z/ := d — £^a, 

whose norm is added to the Lagrangian as a penalty, and needs to be minimized. Of course, 
in this case, the relation z/ = d— £va is a constraint as seen in the term (p,d — £y,a — 'u). As 



we have seen in Proposition 2^ this error term implies a modification of the equations of 
motion. 

One of the aims of the present paper is to transform the control problem corresponding to 
the cost function in (2.10 ) into an optimization problem in which the penalty term ||d — i^vC^P 



appears. This objective motivates the introduction of the distributed optimization problem 
in the next section. ♦ 

2.3 Distributed optimization problems 



Definition 2.10. {istributed optimization problems) A distributed optimization problem im- 



poses the evolutionary state system in (2.1) as a penalty involving a chosen norm, rather 
than as a constraint. The resulting cost functional is thus taken to be of the form 



{u,n) + ^-^\\h- F{n,u) 



dt. 



(2.25) 



where the norm is associated to a Riemannian metric onQ. In this cost functional, the state 
system dynamics (2.1) is imposed only in a distributed sense; namely, as a penalty enforced 
by the norm on Q, not pointwise on Q, as in (2.3). We assume that cr^ >0. 
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We may initially regard this second approach as simply modifying the cost function in 



the optimal control problem (2.3) by introducing a penalty based on a norm of the state 



system. We will show later that the solutions of the two types of optimization problems 
coincide in the limit cr^ — )■ 0. 

In the case where Q is a vector space, denoted by W, and the norm is associated to an 



inner product, the variations of the distributed cost function Sd in (2.25) now yield 

T 



SSr, 







6i f6F\ . ^ . 




T 



dt+ {p,Sn) 



w 



(2.26) 



where the momentum variable p obtained from the variation with respect to the vector field 
h&W is defined by 

a'^p:=(^h-F{n,u)^ eW\ (2.27) 

and in this case the b map (index lowering) is applied with respect to the inner product on 
W. 

Let us return to Example 2 above and treat it as distributed optimization problem. 



Example 



As in §2.2[ we consider the geometric setting of continuum mechanics. Contrary to Example 
1 above, we do not impose the advection equation Qj — £f-^(X clS 3i constraint but clS Qb penalty. 
The problem is now to minimize the expression 







1„ „2 1 „ ^ ,,2 

2l|v||g + ^l|a-i^va||i2 



dt, 



where || ■ ||l2 is a norm on the space of tensor field densities. This problem is clearly 



equivalent to that of Example 2 in §2.2[ The variational principle 5Sd = yields the control 

^ = — {poa)^ 



and the same equations as before 
where we have defined the variable p by 

1 



(2.28) 



p:-- 



{a-£^ayeV. 



(2.29) 



It is important to observe that in this approach the variable p is not really needed, since 
it is defined in terms of the other variables. This is not the case for the Clebsch approach 



described in the Examples of {2.2 for which p is an independent variable. For the Clebsch 



approach, the relation (2.29 ) is recovered as a consequence of the variational principle 5Sc = 0. 
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Control problems versus optimization problems 

We now make some simple comments concerning the role of the variational principles in 
control problems and optimization problems. 

Let i = i{u,n) : ?7 X Q — J-M be a cost function and F a vector field as in the general Defi- 
nition |2.1[ As we have seen, one associates to these objects the following problems. 



(1) The optimal control problem consists of minimizing the integral 

S ■= £{u,n)dt subject to the conditions h = F{n,u) 
Jo 

and the usual endpoint conditions. The resolution of this problem uses the Pontrya- 
gin maximum principle which, under sufficient smoothness condition, implies that a 
solution of this problem is necessarily a solution of the variational principle 



SSc = sj^ (i{^,n) + {a,h- F{n,u))^dt = 0. 



Example 1 in §2.2[ for which the cost function is a kinetic energy and the vector field 
F is given by a Lie derivative, illustrates this method. 

(2) The optimization problem with penalty described above consists of minimizing the inte- 
gral 

Sd:=j^ (^i{u,n) + ^\\n-F{n,u)f^dt 

subject to the usual endpoint conditions. Of course, the solutions of this problem are 
necessarily solutions of the variational principle 



The examples in here illustrate this point. 



Remark 2.11. Despite the analogy between the two variational principles SSc = and SS^ = 
0, the origins of these principles are quite different. 

In the first problem, the functional S is minimized under a constraint, leading to the 
construction of the functional Sc by introducing the costate variable a. The well-known 
Pontryagin approach tells us that the solutions of the optimal control problem are necessarily 
critical points of S^. 

The variational principle of the second problem is simply the stationarity condition im- 
plied by optimization of the functional Sd, without other constraints, except the endpoint 
conditions. ♦ 
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2.4 Overview 

In |2H] a general formulation for a large class of optimal control problems was given. These 
problems, called Clebsch optimal control problems, are associated to the action of a Lie group 
G on a manifold Q and to a cost function £ : g x Q — )■ M, where q denotes the Lie algebra of 
G. The Clebsch optimal control problem is, by definition, 

min [ i{^{t),n{t))dt, (2.30) 
Jo 

subject to the following conditions: 

(A) Either =e(t)Q(n(t)) , or (A)' = ; 

(B) Both n(0)=no and n(T) = nT, 

where C,q denotes the infinitesimal generator of the G-action, that is, 

d 



dt 



t=o 



These optimal control problems comprise abstract formulations of many systems such as 
the symmetric representation of the rigid body and Euler fluid equations [El |36] , the dou- 
ble bracket equations on symmetric spaces [7], the singular solutions of the Camassa-Holm 
equation [16] , control problems on Stiefel manifolds [12] , and others [U [11] . 

Optimal control problems on Lie groups have a long history; see [H], [1H| and references 
therein. Some of the earliest papers dealing with such problems are [13] and 



Goals of the paper 

The first goal of the present paper is to replace the constraints in the Clebsch optimal 
control problem with a penalty function added to the cost function and to obtain in this 
way a classical (unconstrained) optimization problem. The fundamental idea is to use the 
constraints to form a quadratic penalty function in order to get the Lagrangian 




(2.31) 



We first determine necessary and sufficient conditions characterizing the critical points of this 
Lagrangian. Taking the time derivative of one of the conditions and using the others leads 
directly to certain equations of motion. We then show that these equations are naturally 
obtained by Lagrangian reduction and that they are the Lagrange-Poincare equations of a 
Lagrangian function in the material representation that is the sum of the original Lagrangian 
plus the square of the norm on the velocity vector. This approach links directly to the 
approach used in |46j in the study of the metamorphosis of shapes. From a variational point 
of view, one replaces the Hamilton-Pontryagin variational principle in the Clebsch framework 

r 

6 {i{u,n) + {a,h^UQ{n)))dt = , 
Jo 
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by the principle 



S [ (i{u,n) 



Jo V 



+ 



1 

2^ 



{n)r]dt = 0, 



in the framework of distributed optimization. 

This paper traces how the dynamical equations change on moving from constraints (op- 
timal control), to optimization via imposition of a cost, and then on to metamorphosis. 
Passing from optimal control to optimization preserves the momentum map, but this pas- 
sage modifies the reconstruction relation. The evolution is no longer only for the momentum 
map of the reduced Lagrangian. Instead, the momentum canonically conjugate to the veloc- 
ity on the configuration manifold becomes coupled to the momentum map equations (which 
are the Euler-Poincare equations), with coupling constant cr^. 

Another feature of the paper, directly related to the dynamics of our optimization prob- 
lem, is the description of the equations of motion by Lagrangian and Hamiltonian reduction. 
In particular, we carry out a certain type of Lagrangian reduction adapted to the problem, 
that we naturally call metamorphosis reduction, since it was directly inspired by the example 
of the metamorphosis approach to image dynamics pS]. This Lagrangian reduction leads 
to the expression of the associated variational principles and Hamiltonian structures. In 
metamorphosis, the optimization problem involves Riemannian structures induced by Lie 
group actions on themselves and on Lie subgroups by group homomorphisms. This is a rich 
field whose possibilities are still being developed. In particular, metamorphosis and related 
variants of the geometric approach to control and optimization can be expected to produce 
opportunities for new applications and analysis in geometric dynamics. 

3 Distributed optimization 

In this section we begin with a quickly review the Clebsch optimal control problem studied 
in [2E]- Then we introduce the class of optimization problems investigated in this paper, 
obtained by adding to the cost function a penalty given by the norm of the constraints in 
the previous approach. 

3.1 Review of Clebsch optimal control 

Clebsch optimal control formulation and main results 

We recall from |28] some facts concerning Clebsch optimal control problems. Let $ : G x Q — J- 
Q be a left (resp. right) action of a Lie group G on the manifold Q and let i: qx Q he 
a cost function. The Clebsch optimal control problem for the curves ^(t) and n{t) is 





subject to the following conditions: 



(A) Either h{t) =^{t)Q{n{t)) , or (A)' h{t) = -^{t)Q{n{t)) ; 
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(B) Both n(0)=no and n(T) = nT, 
where denotes the infinitesimal generator of the G-action associated to ,^ G 0, that is, 

d 



ds 



cxp(s^) 



s=0 



If condition (A) is assumed, then by applying the Pontryagin maximum principle, we 
obtain that an extremal curve n{t) gQ is necessarily the projection of a curve a{t) & T*Q 
that is a solution of the equations [28] 



5i di 
— = 3{a), d = ^r*Q(a) + Vero — 



(3.2) 



Here 3 :T*Q q* denotes the momentum map associated to the cotangent-lifted action of 
G on T*Q. Recall that J is given by 



(J(ag),0 = («g>'^Q(?))- 

{^{aq),0 = {(^q,^Q{q)) = -{(^qOq,0, wheu ^Q{q) = £^q 
The expression || Gg* denotes the usual functional derivative of i{-,n) for each fixed nEQ 
whereas ^ := d£(^, ■) G T*(5 denotes the differential of the function £(^, ■) :Q— J-IR for each 
fixed ^G0. For a,f3 eT*Q, the map 'VeTa/3 denotes the vertical lift of f3ET*Q relative to 
aeT*Q, defined by 



Verbis :-- 



d 
ds 



{a + sf3)eTo,{T*Q). 



s=0 



In (3.2), ^T*Q denotes the infinitesimal generator of the cotangent-lifted action of G on T*Q. 
Note that the vector field ^r*Q(tt) + Vera, ^ on T*Q is the Hamiltonian vector field associated 
to the Hamiltonian 

an G T*Q ^ (anM^)) - ^(^,^) e M, 

in which the Lie algebra element ^ G is regarded as a parameter. Using these equations, we 
determine that the optimal control ^ is the solution of the equations 



d 5i 
dtSi 



de 

On 



(3.3) 



If condition (A)' is assumed, then (3.2) is replaced by 



-3(a) 



a - 



-^T*Q(tt)+Ver„ 



and the optimal control ^ is the solution of the equations 



d 6i _ 6i 



dn J 



resp. 



d 6i 
dtS^ 



dn 



Si 



(3.4) 



■ad^--3{ 



(3.5) 



We refer to |28j for proofs of these statements and further discussion. 
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Variational principle 



We shall prove that equations (3.2) or (3.4), together with the constraint h = ±^Q{n) follow 
from the variational principle 



5 {t{i,n) + {a,hTiQ{n)))dt = Q, 



(3.6) 



for curves tH'^(t)G0 and t\-^ a{t) eT*^^-^Q . The variations 6^ are free, whereas the vari- 
ations 6a are such that the induced variations 6n vanish at the endpoints, that is, 6n{0) = 
6n{T) = 0. 

To see this, let G and G T*_^Q be curves whose infinitesimal variations at s = are 
S^eg and 6a G T*Q. We have 



S I {i{^,n) + {a,nT^Q{n)))dt: 

'0 



^ /Si \ /di 

Vai-'"^* 



+ 



d 
ds 



s=OJO 



{as,ns)dtT -j- 
ds 



T 



{3{as),Qdt. 



(3.7) 



A direct computation in canonical coordinates, using 6n{0) = 6n{T) = in an integration by 
parts, shows that 

-f- / {as,ns)dt= / ^lcan{a,Sa)dt, (3.8) 
'^"5 3=0 Jo Jo 

where ^Ican denotes the canonical symplectic form on T*Q. In addition, using the definition 

of the momentum map 3 ■.T*Q^q* we have 



d 
ds 



{3{a3),Q = {T^3iSa),0 + {3{a)M)=^c.n{^T'Qia),Sa) + {3icy)M)- (3.9) 



s=0 



Using relations (3.8) and (3.9) in formula (3.7) yields (3.2) and (3.4). 



Alternative form of the stationarity conditions 

Note that the equations 



d = ±^T*Q(a)+Ver, 



di 
dn 



(3.10) 



imply the constraint n = ±^Q{n). To see this, it suffices to apply the tangent map Tit to 
(3.10), where n ■.T*Q Q is the projection, and recall that ^t*q and are vr-related. By 
introducing a Riemannian metric g on Q, it is possible to rewrite the stationarity condition 
in a more explicit way, as we show in the following lemma. 

Lemma 3.1. Suppose that Q is endowed with a Riemannian metric g and denote by V and 
D/Dt the associated Levi-Civita covariant derivatives. 



Then the equation d = ±(^T.Q(a;) + Vera ^ in (3.10) is equivalent to the system 



D 



di 



(3.11) 
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Proof. We begin by recalling the definition and main property of the connector K : TTQ — )■ 
TQ associated to a Riemannian manifold {Q,g). A general detailed treatment for connectors 
associated to linear connections can be found in [SB], Section 13.8. In infinite dimensions 
we need to assume that the given weak Riemannian metric has a smooth geodesic spray 
S eX{TQ). In natural local charts of TTQ, the intrinsic map K is defined by 

Kioc{x,e,u,v) = {x,v + T{x){e,u)), (3.12) 

where r(a;) is the Christoffel map defined by the quadratic form in the fourth component 
of the geodesic spray S{x,u) = {x,u,u,—T{x){u,u)). In finite dimensions, the Christoffel 
map has the familiar expression T{x){e,uy = T''ji^{x)e'^u^ , where F*;. are usual the Christoffel 
symbols associated to the metric g. The relation between the connector and the Levi-Civita 
covariant derivative is given for all X,Y &X{Q) by 

VyX = KoTXoY. (3.13) 

The connector K induces an intrinsic map, also denoted by K:TT*Q^T*Q defined in 
natural local charts by 

ir;„e(x,/3,u,7) = (a:,7-/3(r(a;)(M,-))). (3.14) 
The associated covariant derivative 

Vxa: = KoTaoX (3.15) 

on T*Q recovers the Levi-Civita connection on one-forms aEQ^{Q). Although the same 
notation is used for the connector on TQ and on T*Q, it will be clear from the context 
which one is meant. 

The proof of Lemma 3.1 begins by recalling the vector bundle isomorphism TT*Q^ 
T*Q®TQ®T*Q given by 

X^{aT*Q{X),TTi{X),K{X)), 

where ctt*q : TT*Q — )■ T*Q is the projection. Therefore, to prove the equivalence it suffices to 
apply the maps Ttt and K to the equation a = ±^T*(9(a) + Vera As we have seen before, 
applying Ttt yields the first equation in the system (3.11). The definition (3.14) of K and 
(3.15) immediately imply the equalities 

D f di\ di 

K{a) = -a and ^^Ver„-J=-. 

Thus, to finish the proof, it suffices to compute K {C^t*q{c())- Given Vn ETnQ, an eT*Q, and 
^ e 0, we have 

Taking the s-derivative at s = yields 

{K{^T'Q{ar.)),Vn) + {a:a,Ki^TQ{Vn)))=0. (3.16) 

Noting the equalities K{C,TQ{vn)) = K{TS,Q{vn)) = Vt,„^Q(n), we obtain the formula 

K{iT'Q{an)) = - V^Q(n)) , 
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which proves Lemma 3.1, that the stationarity conditions (3.10) and (3.11) are equivalent 



for a Riemannian manifold. 



System (3.11) may also be obtained directly from the variational principle 6Sc = 0, Sc 



Jo (^(^)''^) + {c(,n^ ^Q{n)))dt, by using a Riemannian metric on Q. However, we have chosen 
to derive the stationarity conditions (3.2) or (3.4) together with the constraint n = ±^Q(n) 



for the functional Sc without introducing a Riemannian metric; see (3.7)-(3.9) above. 



Lagrangian and Hamiltonian approach 



Equations (3.3) and (3.5) can be obtained via Euler-Poincare reduction for the G-invariant 
function C : TG x Q — )■ M induced by i. More precisely, upon fixing qEQ and defining the La- 
grangian Cq{ug) ■.= C{ug,q) on TG, one finds that the equations (3.3) and (3.5) are equivalent 



to the Euler-Lagrange equations for Cg by invoking a generalization of the Euler-Poincare re- 
duction theorem. We refer to ^29j for a proof of this assertion and for applications to systems 
with broken symmetry. If Q is a representation space of G, one recovers the Euler-Poincare 
reduction theorem for semidirect products; see [ill W2\ . 

If the Legendre transform >||Gg* is a diffeomorphism, we can form the associated 

Hamiltonian h:g* x Q — M defined by 



h{fi,n):={(x,^)-i{^,n), where 



5e 



In this case, the Lagrangian C is hyperregular on TG, the variable q&Q being considered 
as a parameter, and we can form the Hamiltonian H:T*GxQ— >]R. More precisely, fixing 
qEQ, we define 

where Eg is the energy associated to the Lagrangian £g:TG— t-M and FCg-.TQ -^T*Q is 
the classical Legendre transform of Cg. The function H :T*G' x Q— t-M is then defined by 



l-i{ag,q) ■. = l-iq{ag). Equations (3.3) and (3.5) can be written in Hamiltonian form as 



/i = =Fadih/i — J — 



and 



n- 



n - 



5h 
6fi 



(3.17) 



in), 



Q 



■ ± adih /U + J 

6fi 



dh\ 
dn ) 



(3.18) 



respectively. They are obtained by Poisson reduction of Hamilton's equations for % on 
T*G X Q, where Q is endowed with the zero Poisson structure. 



In terms of h, the equations (3.2) or (3.4) read 



/i = J(a) 



a - 



5h 
6fi 



(a) — Ver, 



dh 

dn' 



(3.19) 



Gay-Balmaz, Holm and Ratiu 



Geometric optimization dynamics 



27 



and 

, = -J(a). d.= -(|)^Ja.)-Ver„|. (3.20) 



As in Lemma 3J^, by introducing a Riemannian metric g on Q, these equations can be 
rewritten as 

3.2 Optimization using penalties 

As before, we consider a left (resp. right) action $ : G x Q — t- Q and a cost function £ : g x Q — )■ 
M. We suppose that the manifold Q is endowed with a Riemannian metric g. The basic idea 
is to treat the condition (A) or (A)' as a penalty rather than a constraint. Therefore, in the 
case of condition (A) above, we consider the minimization problem 

rnin^ (^i(^,n) + ^\\h-^Q{n)f^ dt, (3.22) 

and if condition (A)' holds, we consider 

rnin^ (^i{^,n) + ^\\h + ^Qin)\\'^ dt. (3.23) 

These two problems are subject to the condition 

n(0)=no and n(T)=nT, 

for given no,nT & Here the norm is taken with respect to the Riemannian metric g on Q 
and cr 7^ 0. 

Stationarity conditions 

In order to find the critical curves, we consider the variational principle 

S £ (^m,n) + ^\\hT^Qin)fyt = (3.24) 

for the two curves (^,n) : [0,T] ^gxQ, where n has fixed endpoints. That is, the variation 
6^ is free and the variation 6n vanishes at the endpoints. 

We will treat condition (A) and (A)' simultaneously. In all the expressions below, the 
upper sign refers to condition (A) and the lower sign refers to condition (A)'. The ^-variation 
yields the condition 

6i 1 

— = ±— J(z/j;), where z/„ := nip ^^(n), (3.25) 
()4 
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and 1^1^: = g{n){i'n,-) eT*Q . We now compute the variations of n, where we denote by V 
and D/Dt the covariant derivatives associated to the Levi-Civita connection of the metric 
g. For 5n= 41 ^n^, we have 



V\^"' 



,5n) + 



a 



D 



2 \ ""n, ^5 



s=0 



D 

Ds 



s=0 



dt 



dn^ / a 



Upon exchanging the order of derivatives, -§i-^ = (which is allowed because the Levi- 
Civita connection has no torsion) one finds the equation 



(3.26) 



Consequently, {^,n) is a solution of (3.24) if and only if (3.25) and (3.26) hold. In what 
follows, equations (3.25) and (3.26) will be called the stationarity conditions. 

Note that here, in contrast to the argument in ^3.1 , specific use of the Riemannian metric 
is made in computing the stationarity equations from the condition dSd = 0, where 



1 



^i^,n) + —\\nT^Q{n) 



dt. 



(3.27) 



This is natural, because a Riemannian metric is provided by the penalty term in the problem 
statement. Using the notation 



TT := 



^ {nT^Q{n)feT*Q, 



enables the stationarity conditions (3.25) and (3.26) to be written as 
61 



-.=T(.,ve,) + -, 



(3.28) 



These equations should be compared with the other stationarity conditions (3.2) and (3.11), 

di 



^ = ±J(a), h = ±^Q{n), -^a = ^(a,V^Q) , , 



(3.29) 



associated to the Clebsch optimal control problem. These two sets of stationarity condi- 
tions are analogous. However, the corresponding variables a and vr have different origins. 
Namely, the costate variable a was introduced as the Lagrange multiplier in formulating the 
constrained Clebsch variational principle (3.6), whereas the variable tt arises as a canonical 
momentum, dual to the penalty variable in the unconstrained variational principle (3.24). 

Recall from Lemma 3.1 that the last two stationarity conditions of the system (3.29) are 
equivalent to 



d = ±^T*Q(a)+Vera 



di 

dn 



An analogous result concerning the stationarity conditions of the distributed optimal control 
problem is given by the following lemma. Let tl := : T*Q ^ TQ. 
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Lemma 3.2. The system of two equations 



(3.30) 



is equivalent to the single equation 



di 



71 = ± ^T-Q (tt) + Ver^ ^ + a^Sin) , 

where S G X{T*Q) is the Hamiltonian vector field associated to the kinetic energy of the 
Riemannian metric. 

Proof. It suffices to observe that the vector field S verifies the properties 

K{S{a)) = Q and TT:{S{a)) = a\ 



for all aET*Q. Then the proof is similar to that of Lemma 3.1 



Remark 3.3. In terms of the Hamiltonian h associated to i, the stationarity conditions 



(3.25) and (3.26) read 



n = ±{ (n) + aV 



D / „f6h\ , , 



dh 
dn 



or, equivalent ly, 



vr = ±(f') (7r)-Ver.f^ + a25(7r) 



T*Q 



These equations should be compared to their analogues in (3.19) - (3.21). ♦ 



Equations of motion associated to the stationarity conditions 



We now compute the differential equation associated to condition (3.25), that is, the analogue 



of equations (3.3), (3.5). The formulation will involve the following 0*-valued (1,1) tensor 
field. 

Definition 3.4. Consider a Lie group G acting on a Riemannian manifold {Q,g). We define 
the Q* -valued (1,1) tensor field :T*Q x TQ^g* associated to the Levi-Civita connection 
V by 

{j^^{ag,Uq),r]) := (a^, V„^r/Q(g)) , (3.31) 
for all Uq e TqQ, E T*Q, and G 0. 

The main properties of the tensor field J-"^ are given in the following lemmas. 
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Lemma 3.5. For all G T*Q, Uq G TqQ, and ^ G 0, 

where K denotes the connectors of the covariant derivatives on TQ and T*Q, respectively 
{see formulas (3.12)-(3.15)). 



Proof. It suffices to use formula (3.16) in the proof of Lemma 3.1 ■ 

The following important property of J-"^ is valid when G acts by isometries. 

Lemma 3.6. If G acts by isometries, then J-"^ is antisymmetric, that is 
for all Uq G TgQ, aq G T*Q. 

Proof. Since G acts by isometries, £^gg = which implies (V^q)^ = — V^q. ■ 
We also need the following preparatory lemma, valid for any action. 

Lemma 3.7. Let 3:T*Q—^q, (J(«g),0 = momentum map of the 

cotangent-lifted action of G on T*Q and let g be a Riemannian metric on Q. Then for 
a curve a(t) & T*f^^^Q we have 



J(a(t))=J(^a(t))+^^(a(t),g(t))), 



where D/Dt and V denote the Levi-Civita covariant derivatives associated to g. 
Proof. For all f]EQ, we have 

^(J(a(<)).')> = ^(«W,wfeW)) = (^aW.')o('(W)V('"(*).-F^')efeW) 

\Dt 

Using the definition of J-"^ implies the required formula. 



Dt 



J (§-Atyj ,7]\ + (^a{t),Vqit)VQ{q{t))). 



Note that this proof of Lemma 3.7 did not assume that the metric is G- invariant and 
that the formula is valid for left and right actions. 

Lemma 3.7, and equations (3.25), (3.26) enable one to compute the motion equations 
associated to the minimization problems (3.22), (3.23) as follows: 

d S£ 



dtS^ 



, 1 b- 



1 



±,( 



\dn) C72 ^ ' ^ ' V2 



±^^^(z/^,z/„)±(T)ad; 



51 
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where in (=f) one chooses — (resp. +) when G acts on Q by a left (resp. right) action; so in 
the last term there are four choices of sign. Thus, when the penalty is given by H^i- — ^q('^)|P 



(condition (A)), the critical curves of the variational principle (3.24) are solutions of 



d 6i 
D . 



■ =F ad! 



a-" 



\dn J 

When the penalty ||?^ + ^q(^)|P (condition (A)') is chosen instead, one finds, 

= + ar{* J 

dt 6^ 

D . 



(3.32) 



dn 



(3.33) 



Remark 3.8. The motion equations (3.32) and (3.33) should be compared to the analo 



gous motion equation (3.3) and (3.5), respectively, obtained by the Clebsch optimal control 



approach. Note that the term J-'^(i/^,z/„) is an additional force term that is due to the 
presence of the quantity z/„. The variable = ra± C,Q{n) measures the inexact matching and 
evolves according to the second equation -^z/j^ = ±(7(i/„, V^g) + cr^|^. We shall return to the 
discussion of inexact matching for images in Section 8.5[ ♦ 



Thanks to Lemma 3.6 
tries. 



we 



obtain the following important result, when G acts by isome- 



Theorem 3.9. Let G be a Lie group acting on a manifold Q and let i: qx Q be a cost 
function. We consider the two associated Clebsch optimal control and distributed optimiza- 
tion problems. Suppose that the Riemannian metric used in the penalty term is G-invariant. 
Then the two problems yield the same equations of motion. 



(3.3), (3.5). 



Proof. It suffices to use Lemma 3.6, and to compare equations (3.33), (3.32) with equations 



For completeness we rewrite below the equations (3.32) and (3.33) in the particular case 
where G acts by isometrics. Using J^^(z/^,i/„) =0 and V^q = — V^g, for this case yields 



and 




(3.34) 



■n- 



df 

Dt " ' ""'"^ ' " 

Remark 3.10. The remainder of the present paper will investigate these equations as dy- 
namical systems, rather than as optimal control problems. See 
approach is taken. ♦ 



(3.35) 



in which a similar 
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4 Lagrange- Poincare and metamorphosis reduction 

In this section, we present two Lagrangian reduction approaches that will be useful in under- 



(3.23), (3.22). 



standing the geometry of the equations (3.33), (3.32) associated to the minimization problem 



Let G act on the left (resp. right) on Q. Let L : T(G x Q) — J-M be a left (resp. right)- 
invariant Lagrangian under the action of G given by 



Uq)^ {hUg,hUq) resp. {Ug,Uq)^ {Ugh,Uqh). 



Two reduction processes are discussed. The first uses Lagrangian reduction (see |TH]) and 
the second is a formulation of the reduction used for metamorphosis in 



Theorem 4.1. {Lagrange- Poincare reduction) Let g EG and qEQ be two curves and define 
the curves n:= g'^q eQ and ^ : = g~^ g E q {resp. n:=qg~^ eQ and ^■.= gg~^ Eq). 

Then {g,q) is a solution of the Euler-Lagrange equations for L if and only if (n,^) is a 
solution of the Lagrange-Poincare equations 

dt ^^'^ ' dt -^'^ ' 

Dt On On ^ dt ^ K Dt dh On ^ dt ' 

where the Lagrange-Poincare Lagrangian iLP = ^Lp{n,n,C,) '-TQ x q^M. is induced from L by 
the quotient map 

T{G xQ)^TQxg, {uq,Uq) ^ (n,n,0 := {un-^Q{n),0 (4-2) 

forn: = g~^q, Vn- = g~^Uq, ^■.= g-^Ug {resp. n:=qg-^, Un: = Uqg-^, ^■. = Ugg-^). 
These equations are equivalent to the variational principle 

S I iLp{n,h,Odt = 0, 







for arbitrary variations 6n and constrained variations SC,=ri-\-[C,,'r]] {resp. S^ = fj—[C,,ri]). 

In the Lagrange-Poincare equations, D/Dt and diip/dn denote the covariant derivative 
and the partial derivative associated to a fixed torsion free connection V on Q. 

Proof. We treat the case of a left action and apply the results of [IH]. The projection associ- 
ated to the G-action reads tt:GxQ^Q, 7!'{q,g) = g~^q. Thus, by taking the tangent map, we 
find T7i{ug,Uq) = {g~^Ug — {g~^Ug)Q{g^^q)). The adjoint bundle Ad(G x Q) can be identified 
with the trivial vector bundle Qxq via the identification [(^f,^),^] ~ ((jf^^g, Adg-i(^). Using 
the principal connection A{ug,Uq) : = Ugg~^ , the diffeomorphism {T{G x Q))/G = TQ x q is 
given by [ug,Uq]\-^ {g^^Uq — {g^^Ug)Q{g^^q),g^^Ug). Thus, the Lagrange-Poincare reduction 



map has the required expression (4.2). Since the chosen principal connection is flat, we 



obtain the Lagrange-Poincare equations (4.1). 
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For the same G-invariant Lagrangian L :T(G x Q) — J-M as before, we define another re- 
duced Lagrangian £M = iM{^n,0 '■'^Q ^ associated to the quotient map 

T{GxQ)^TQxQ, {ug,Uq)'^{un,C):={g'^Uq,g'^Ug), resp. {Pn.C) ■={uqg~^ ,Ugg~^). 

This reduced Lagrangian differs from the Lagrange-Poincare Lagrangian £lp defined above, 
but one can pass from the one to the other by the vector bundle isomorphism 

TQxQ^TQxQ, (z/„,0 ^ {vn-iQ{n),0. (4.3) 

that is, we have 

£lp (n, n, = (n, n + (?^) , , 

for both the left and right cases. The reduction associated to this quotient map will be called 
metamorphosis reduction, since it is the abstract framework underlying the metamorphosis 
dynamics described in |16] . 



Theorem 4.2. {Metamorphosis reduction) Let g EG andqEQ be two curves and define the 
curves h'n: = g~^q ETQ and ^ : = g~^ g E q {resp. Un:=qg^^ETQ and ^ := gg^^ E q) . 

Then {g,q) is a solution of the Euler-Lagrange equations associated to L if and only if 
(i/,^) is solution of the equations 



dt 6^ 
D d^M 

, Dt dUn \ dUn 



di 



M 



di 



M 



dn 

diM 
dn 



di 



M 



d 

dt 



(4.4) 



where + {resp. — ) occurs when G acts on Q by a left {resp. right) action, and J-"^ is the 
Q*-valued (1,1) tensor field defined in (3.31). In (4.4), diu/dn and diujdvn denote the 
horizontal and fiber derivatives, respectively. 

These equations are equivalent to the variational principle 



S 



-M 



'M)dt = 0, 



with variations 5^ = r]+[^,r]] {resp. S^ = ri—[^,ri]) and = jy^uj ^-W ui^q — W uT^q. 
The proof will use the following lemma. 



Lemma 4.3. Consider the two reduced Lagrangians iip and iu- Then we have the relations 




Proof. Using the relation iLp{n,h,^) =iM{n,n + ^Q{n),^), we easily obtain the first and 
third expression. For the second we recall that partial derivatives ^^j^, are defined with 
the help of a connection V on Q. Let c{t) ETm(t)Q be a smooth horizontal curve covering 
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a curve m(t)EQ and such that c(0)=n, m(0) =m„ G T^Q. By using the decomposition of 
TTQ into its vertical and horizontal part, we have 



LP 



On 



d 
dt 



i=0 

dTQiMin,h,0 
diM 



d 

dt 



^M{c{t)+iQ{m{t)),i) 



i=0 



d 

dt 



c{t) + 



dn 



de 



t=o 



d 
dt 



t=o 



d 
dt 



c{t) + 



d 
dt 



^Qim{t)) 



M 



dn 



dt 

de 



c{t) + 



t=0 



t=0 

d_ 

dt 



t=o 



M 



dVn 



where K -.TTQ -^TQ denotes the connector map associated to V. Here dj-q is the exterior 
derivative on TQ and the fourth equality is a general formula valid for linear connections 
that links the total derivative to the horizontal and vertical derivatives. ■ 



Proof of Theorem |4.2[ We treat simultaneously the case of a left and right action. Using 

obtain the equ 

n = i'n-iQiri). 



the second equation in (4.1) and Lemma 4.3, we directly obtain the equations 

d 



D diM di 



M 



Dt dfn dn 



di 



M 



dUn 



dt 



By Lemma |3.7[ for any rjEQ, we have 

D diM 



d diM 



dt V dun 



Dt du, 

diM 
dn 



) + { ^^^nVqin] 



dv„ 



di 



M 



dn 



diM 
dv„ 



di 



M 



dVr, 



di 



M 



dUr, 



di 



M 



dVr. 



where we use the equalities V^^^^g — ^^qVq = [Vq^^q] = -f[v^^]Q- We thus obtain 



d diM 



dt \ du, 



n 



di 



M 



dn 



diM 1 , T 

- — ,z/„ I ± ad^ J 
du. 



di 



M 



dUr, 



Inserting the formula = + J (^^7^ j in the first equation of (4. 1 ) and using the previous 
expression for ^^f-j, we get the required equation 



d SiM . ,* SiM 

■ ± adc —rz J 



dt 5^ 



di 



M 



dn 



di 



M 



dUn 



,Ur. 
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Left (right) reduction and right (left) action 

In some applications, we need to consider left-invariant (resp. right-invariant) Lagrangians 
whereas G acts on Q by a right (resp. left) action. We quickly present here the situation, by 
giving the main formulas in this case. Let G act on the left (resp. right) on Q. We consider 
here the case of a right (resp. left) invariant Lagrangian L : T{G x Q) — j-M under the action 
of G given by 

The Lagrange-Poincare Lagrangian Ilp :TQ x q^M. is now induced by the quotient map 

T(G xQ)^R, {ug,Ug) ^ (n,n,0 := (//„ + eQ(n),0, (4-6) 

for n: = gq, 1^^:= gUg, ^ :=Ugg^^ (resp. n:=qg, h'n- = Ugg, C,:=g^^Ug). The Lagrange-Poincare 
equations are now given by 



d 



dt 

D dhp 
Dt dh 



LP J* SIlp 



d 



'LP 



de 



resp. 



LP 



dn 



-0, 



dt 



-n = n. 



dt 6^ 

D dhp 
Dt dh 



■ ad; 



6i 



LP 



dn 



0, 



di' 



(4.7) 



-n = n. 



Note the change in the sign when compared to (4.1). Note that we now have the relation 
iLp{'>T',h,^) =iM{h — ^Q{n),C,). Therefore, the conclusions of Lemma 4.3 should be replaced 
by 



5^ 5^ 



-J 



di 



M 



dVn 



di 



LP 



di 



M 



dn dn 



diM 



M 



dVn 



di 



LP 



di 



M 



dh dvn 



Thus, equations (4.4) are replaced by 



r d 6i 



M 



dt 5^ 

D diM 

Dt dv„ 



-r- , T 

■ =F adf -^rzr + J 



diM 

dVr,. 



di 



M 



dn 



di 



M 



dVn 



di 



M 



d 



dn ' dt 



where — (resp. +) occurs when G act on Q by a left (resp. right) action. 



Alternative form of the equations 



For completeness, we give here an alternative form for the second and third equations in 



systems (|4.4|), (4.8). This alternative form is analogous to that given in Lemmas 3.1 

d diM 



3.2 



and reads 



dt (9z/„ 



T'Q 



M 



di_ 
dv„ 



+ Verc 



di 



M 

dn 



Horc 



(4.9) 



where, for a„GT*(5, Yioi '■ TnQ ^Ta„T*Q denotes the horizontal lift associated to the 
Levi-Civita connection on T*Q. Note that we have the formula Hor^b Vn = S{vn)-, where as 
before, iSg3£(T*Q) is the Hamiltonian vector field associated to the kinetic energy of the 
Riemannian metric. 
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5 Optimization, the Lagrangian approach 

In this section, we show how to obtain the motion equations associated to the distributed 
optimization problem by using Lagrangian reduction. More precisely, we will use the meta- 
morphosis reduction, starting from the unreduced Lagrangian associated to i, augmented by 
the square of the norm of the velocity vector. 

Let G act on the left (resp. right) on Q and consider a cost function i: = i{^,n) on qxQ. 
Let C : TG x Q — M be the associated G-invariant Lagrangian on TG x Q. The definition of 
C depends on the condition ((A) or (A)') we want to impose. 

• If (A) holds, we suppose that £ is invariant under the right (resp. left) action 

{Ug,q)^{ugh,h'^q), resp. {Ug,q)^ {hug.qh"^), (5.1) 
i.e., we define C{ug,q): = £{ugg-^,gq), resp. C{ug,q) := i{g-'^Ug,qg). 

• If (A)' holds, we suppose that £ is invariant under the left (resp. right) action 

{ug,q)^{hug,hq), resp. {ug,q) ^ {ugh,qh), (5.2) 
i.e., we define C{ug,q) : = i{g'^Ug,g'^q), resp. C{ug,q) := i{ugg'^ ,qg'^). 



Definition of the unreduced Lagrangian 

The G-invariant Lagrangian C : TG x Q — > M produces the function i by reduction. We now 
want to modify C in order to obtain, by reduction, the integrand 

i{Ln) + ^Jn±^Q{n)r (5.3) 

of the distributed optimization problem. This will be done by constructing, from £, a G- 
invariant Lagrangian L defined on the tangent bundle T{GxQ). Of course, the definition 
of L depends on the condition ((A) or (A)') we want to impose. 

• If (A) holds, we define L : T(G x Q) by 

L{ug,Ug): = C{ug,q) + ^\\gUqf, resp. L{ug,Ug) := C{ug,q) + ^\\ugg\\^ . (5.4) 



• If (A)' holds, we define L : T(G x Q) ^ M by 

L{ug,Ug) : = C{ug,q) + ^Wg'^Uqf , resp. L{ug,Ug) := C{ug,q) + ^\\ugg'^f . 

(5.5) 

Of course, the norm appearing in the second term of the Lagrangian is the same as the norm 
used in the integrand (5.3) of the distributed optimization problem. It is associated to a 
Riemannian metric on the manifold Q. The presence of the group action in the second term 
is needed in order to make the Lagrangian G-invariant. 
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In the particular case where the Riemannian metric is G-invariant, the Lagrangian L is 
simply given by 

L{Ug,Ug):=C{Ug,q) + —\\Ugf 

and the associated Euler-Lagrange equations for L read 



D 



d dC dC 



0, 



dt dg dg 

where D/Dt denotes the covariant derivative associated to the Riemannian metric on Q. 



Lagrangian reduction 



Using the quotient maps (4.6) and (4.2) associated to Lagrange-Poincare reduction, we can 



compute the reduced Lagrangian associated to L. When the G-invariance (5.1) (condition 
(A)) holds, we get 

i^p{n,n,0 = i{^,n) + ^\\h-^Q{n)f, 



and when the G-invariance (5.2) (condition (A)') holds, we get 



iLp{n,h,0=i{^,n) + ^\\n + ^Qin)f. 

We have thus obtained the integrand of the distributed optimization problem by Lagrange- 
Poincare reduction. However, in order to compute the associated equations of motion, it 
will be more appropriate to use metamorphosis reduction. For this approach, the reduced 
Lagrangian is readily seen to be 

in all cases. 

We now compute the reduced equations of motions. Since the functional derivatives of 
are 

6iM^Sl ^^l^b diM ^ di 

5^ 5^' dun dn dn 

the reduced equations (4.8) (associated to condition (A)) and ( 4.4[ ) (associated to condition 
(A)') become, respectively 



d 6i 
dtJl 



(5,6) 



and 



d se 



51 fdi\ 1 V/ b ^ 



di 



(5.7) 
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These are exactly the equation (3.32) and (3.33) that are verified by the extremals of the 



distributed optimization problem, obtained here by metamorphosis reduction. 



Remark 5.1. The fact that metamorphosis reduction recovers the motion equations verified 
by the extremals of the distributed optimization problem is natural in the following sense. 
The extremals are given by the unconstrained variational principle 



= SSh = S 



1 

2^ 



this gives the stationarity conditions (3.25), (3.26). These imply (but are not equivalent 



to) the metamorphosis equations (3.32), (3.33) obtained form the same action Sd under 
constrained variations. ♦ 



6 Hamilton-Poincare and metamorphosis reduction 

In this section, we present the Hamiltonian side of the two Lagrangian reduction approaches 
described in Section HI 

As before, we let G act on the left (resp. right) on Q. We consider a left (resp. right)- 
invariant Hamiltonian if :T*(G x Q) — j-M under the action of G given by 

{ag,ag) {hag,hag) resp. {ag,ag)\-^ {agh,agh). 

As before, there are two reduction processes. The first uses Hamilton-Poincare reduction (see 
[T7] ) and the second is the Hamiltonian version of the metamorphosis reduction described 
in Section m 

Theorem 6.1. {Hamilton-Poincare reduction) Let ag G T*G and ag G T*Q be two curves and 
define the curves Hn:= g^^ag^T*Q and jj,:= g~^ag + i{g~^ag) Eq* {resp. T:n'- = Oiqg^^ eT*Q 
and fi := agg~^ + 3 {agg^^) Gg*). 

Then {ag,ag) is a solution of the canonical Hamilton equations for H on T*GxT*Q if 
and only if{iTn,fJ') is a solution of the Hamilton-Poincare equations 

^ - A* ( ^ _ A* 

~rf-l — Hdshfip /J' , I "77 A'' — — tXCishffp^, 

dt Sfi I dt Sfj, 

resp. < (6.1) 
d dhnp d dhnp d dhup d dhnp 

dt diT ' dt dn ' ^ dt dir ' dt dn ' 

where the Hamilton-Poincare Hamiltonian hHP = hHp{'n'n,f^) '■T*Q x g* is induced from 
H by the quotient map 

T*{GxQ)^T*Qxq\ (ag,ag)^(7r„,/i):=(7r„,/c + J(7r„)) (6.2) 

for n: = g-'^q, Tin-=g'^aq, i^: = g~^ag, {resp. n:=qg-'^, TXn-=aqg~'', K:=agg~^). 

In the Hamilton-Poincare equations (6.1), the second equation is written in Darboux 
coordinates. One can write it intrinsically as 

— TTn = XhHp{TTn), 
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where Xh^p is the Hamiltonian vector field ofhup viewed as a function on T*Q, the variable 
fi&Q* being considered as a parameter. 

Proof. We treat the case of a left action and apply the results in [17]. The coadjoint bun- 
dle Ad* {G X Q) can be identified with the trivial vector bundle Q x Q* via the identification 
[{g, q),fi]:^{g^^q, Ad* fi). Using the principal connection A{Ug,Ug) :=Ugg~^ , the diffeomor- 
phism {T*{GxQ))/G = T*QxQ* is given by [ag,ag]\-^ {g~^ag,g~^ag + J{g~^ag)). Indeed, 
the horizontal-lift associated to A reads iloT(^ggy.TnQ^TgGxTgQ, B.oT(^g^q)Vn = {Og,gVn), 
its dual map is [Hor(g g)]*(ag,ag) = (yf~^aq, and the momentum map J: T*(G x Q) — J-g* is 
S{ag,ag) = agg^^ + 3{ag). Thus, the Hamilton-Poincare reduction map has the required ex- 



equations (6.1) 



pression (6.2). Since the chosen principal connection is fiat, we obtain the Hamilton-Poincare 



It is convenient to write the equations of motion (6.1) in matrix form, namely 



d 


/i 




± adn /i 


dt 






^Lni^n 



5hHP 

J \AT*QhHPA 



(6.3) 



where d^.g is the exterior derivative on T*Q. 

For the same G-invariant Hamiltonian H ■.T*{GxQ)^M. as before, we define another 
reduced Hamiltonian Km = hMi'n'n,!^) : T*(5 x g* — j-M associated to the quotient map 



T*{GxQ)^T*QxQ*, 
{ag,ag)\^{7!-n,n):={g'^ag,g'^ag), resp. 



{agg ^,agg ^] 



This reduced Hamiltonian differs from the Hamilton-Poincare Hamiltonian Hhp defined 
above, but one can pass from the one to the other by the vector bundle isomorphism 

T*Q X Q*^T*Q X Q*, (7r„,/i) ^ (7r„,/i- J(7r„)), 



that is, we have 

hHp{Hn,K + 3{'Kn)) = hM{7!-n,K), 

for both the left and right cases. Of course, the previous isomorphism is the dual map to 



(4.3). 



As on the Lagrangian side, we fix a Riemannian metric g on Q. This allows us to write 
the reduced Hamilton equation a bit more explicitly. Note, however, that it is possible to 



write the reduced Hamilton equations without the help of a metric; see (6.5) below. 



Theorem 6.2. {Metamorphosis reduction) Let agET*G and ag(zT*Q be two curves and 
define the curves TTn- = g~^aq&T*Q and n:= g'^agE Q* {resp. Tin'-= oigg^^ eT*Q and k: = 
agg'^eQ*). 
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Then {ag,aq) is a solution of the canonical Hamilton equations for H on T*GxT*Q if 
and only if {71^1 k) is a solution of the equations 



( d 


Sk 




dt 

< 


d 




^ dt " 


~ \ 6k 



dh 



M 



dn 



OTTn J 



(6.4) 



where X^j^^ is the Hamiltonian vector field associated to Km viewed as a function of TCn- 

Proof. We treat simultaneously the case of left and right actions and apply Poisson 
reduction. The reduced Poisson structure on T*Q x g* associated to the quotient map 
{ag,ag) ^ {g-^ag,g-^aq), resp. {ag,ag) h-^ {agg~^ ,agg~'^) is given for any f,ge C°^{T*Q x q*) 
by 



SI 5i 



where ^ : T* {T* Q) q* is the cotangent bundle momentum map and the last term is the 
canonical Poisson bracket on T*Q; see Proposition 10.3.1 in [H2]. Consequently, the reduced 
Hamilton's equation are 




K = ± ad*6hja + J'idhMi'^'n)) 

Sk 



I'M 

6k, 



(6.5) 



Now it suffices to decompose the derivative dhM into the vertical (fiber) and horizontal 



partial derivatives and use Lemma 3.5 to write 

'dhM 



J{dhMM)=J 



dn 



(6.6) 



This proves the result. 



Equations (6.4) can be conveniently written in matrix form 

Shu 
5k 

d-T'QhuA 



d 


K 




± adp K J' 


dt 


7r„ 







(6.7) 



where in the (1,2) entry one uses formula (6.6). We shall see in Section 8.1 that if Q is a 
representation space of G, this formula gives rise to a Lie-Poisson equation on a semidirect 
product with a cocycle. 
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Left (right) reduction and right (left) action 

We quickly present here the equations arising when the Hamiltonian H:T*{GxQ)^'R is 
invariant under the action of G given by 



{ag,ag) {agh,h ^ag) resp. 



[a 



9' 



The Hamihon-Poincare Hamihonian hup: T*Q x g* — j-M is now induced by the quotient map 
T*(Gxg)^M, («g,«g)^(7r„,/i):=(7r„,fi:-J(7r„)), (6.8) 



for n: = gq, 7in-=g<yq, K:=agg ^ (resp. n:=qg, 7in-=aqg, i^- = g ^Ug). The resulting 
Hamilton-Poincare equations are given by 



at Sfj, 
d dhup d 
dt dix ' dt 



dn ' 



resp. 



d _ 

—fj. — aa shjjp /i, 

(It 5fi 

d dhffp 
dt dn ' 



d 
dt 



dh 



(6.9) 



HP 



dn 



Note the change in the sign when compared to (6.1). Note that we now have the relation 
/iHp(7r„,K- J(7r„)) = /iAf(vr„,K). 



Likewise, equations (6.4) are replaced by 



d ^ dhM\ , f 



d 

dt''''- 



6h 



M 



(6.10) 



T'Q 



where — (resp. +) occurs when G acts on Q by a left (resp. right) action. As in (6.3) and 



(6.7), equations (6.9) and (6.10) may be re-expressed in matrix form as 

5hHP 



d 






=F adp /i 


dt 


7r„ 




^Lni^n 



(6.11) 



d 


K 




=Fadn/t —J' 


dt 









Sh 



M 



5k 



(6.12) 



Remark 6.3. (Link with the untangling map) Recall that the vector bundle isomorphism 

(e, ^^n) e X TiV ^ (e, z/„ ± (n) ) = (e, n, ri) G X TiV 

allows one to pass from the metamorphosis reduced equation to the Lagrange- Poincare equa- 
tions. Its dual map 



(/i,7rO €0* X T*N^{^± J(7r„),7r„) = (/t,7r„) G0* x T*N, 
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naturally passes from the Hamilton-Poincare description to the metamorphosis approach. 
The inverse of this map is known as the untangling map in applications (^33^) since it trans- 
forms the Hamiltonian structure of the metamorphosis equation into the direct sum of the 



Lie-Poisson bracket on g* and the canonical Poisson bracket on T*N; see (6.1)-(6.7) and 



(6.11 )-(6.12). Recent theoretical developments and new applications of the untangling map 



appear in [29]. ♦ 

Legendre transformation and alternative formulation 

When the Hamiltonian H comes from a Lagrangian L by Legendre transformation, then we 
have the following relations between the reduced objects: 



6i 



M 



di 



M 



6^ ' "" dun 

The partial derivatives with respect to n are related by the formula 

dhM OIm 



dn 



dn 



In this case, the reduced equations on the Lagrangian and Hamiltonian side ((4.4), (4.8) and 



(6.4), (6.10)) are readily seen to be equivalent. To see this, it suffices to use the formula 

Xhian) 



dh dh 

Hor„„- Ver„„ — 

oan on 



for the Hamiltonian vector field, together with the alternative formulation (4.9) for the 



reduced equations on the Lagrangian side. This also shows that the second equation of the 

dhu 



systems (|6.4|) and (|6.10|) can be equivalently written as 



D 

Dt' 



-TTr, 




:^ TT, 



dn 



d .( ShM 
—n=± — — 
at \ OK 



in + 



Q 



7 Optimization, the Hamiltonian approach 



Suppose we are given a left (resp. right) action of G on Q and a cost function i = i{^,q) 
on gxQ. Let the map ^ 1| be a diffeomorphism and consider the associated Hamiltonian 
h:g*xQ^R defined by 

6i 

h{fi,q):={fi,0-K^,q), T7 = /^- 

As in Section |5] for i, the function h induces a G-invariant function "H : T*G x Q — j-M. Of 
course, "H can be obtained from £ by a Legendre transformation, the variable q being con- 
sidered as a parameter. Recall that given a Riemannian metric g on Q, we associated to £ a 
G-invariant Lagrangian on T(G x Q) by adding to £ a G-invariant expression involving the 



norm of the vector in TQ; see (5.4), (5.5). For example, in the case of condition (A)' and if 



G acts on the left we have defined 



L{Ug,Uq) 



■^{ug,q) + ^\\g ^Ugf. 
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Taking the Legendre transformation of this hyperregular Lagrangian yields the G-invariant 
Hamiltonian H on T*{G x Q) given by 



cr 



-1^ l|2 



H{ag,aq)=H{ag,q) + — \\g a, 



The reduced Hamiltonian associated to metamorphosis reduction reads 



a 



huiiimi^) = h{n,n) + — \\T[n\ 



When condition (A)' is assumed, the reduced Hamilton-Poincare Hamiltonian reads 



a 



In the case of condition (A), we have 



/i/fp (7r„ , /i) = /iM (7r„ , + J (7r„,) ) = /i (/i + J (7r„) , + y 1 1 7r„ I p . 



Using the relations 



dhM 2 tt 1 dh 
— cr vri; and — 



dn dn ' 



the reduced equations (6.4) and (6.10) become, respectively 



— K = ±adih K + J 7^ I , 
at sn \on 



d 



and 



dt 

( d 

it 
d 
dt 



6h 
6k 



, . , . dh 
.vr„)-Ver^„ — + (T S[ 



TXr. 



, K = =Fad^K-J( ^ ) 
dt \on J 



where S G X{T*Q) is the Hamiltonian vector field associated to the kinetic energy 



1 



1 



7r„ir = -^(4,4)- 



(7.1) 



(7.2) 



2" 2^ 

These equations recover the motion equations associated to the distributed optimization, in 



Hamiltonian form, cf. Remark 3.3 



8 Examples 



In this section we apply the general theory to various group actions. 
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8.1 Action by representation and advected quantities 

Let G be a Lie group acting by left (resp. right) representation on the dual vector space 
Q = V*. Given a Lie algebra element ^, we denote by ^v*i(^)=^ci (resp. ^y. (a) = a^) the 
associated infinitesimal generator. Using the diamond operator o:V xV* ^g* defined for p G 
V and aEV* hj (poa,^) := —{^a,p), (resp. (poa,^) := —{a^,p)), for any ^Eq, the cotangent 
bundle momentum map is J{a,p) = —poa. 



Metamorphosis reduction and Lie-Poisson formulation with cocy- 
cles 

Assume that \^ is a left representation space of G and that reduction has been performed on 
the left. The other cases have similar formulations. In view of the identities above, equations 
(4.4) become 

d 51m 51m , 51m , 51m 
-r = adf — — + — ^ — oa+— — ov 
dt 5^ « 5^ 5a 5iy 

d 5iM Jiu , 5iM d 

dt 5v oz/ 5a dt 

where £m = ^^//(C, a, z/) : g x V"* x \/* — )• M is the reduced Lagrangian. Performing the Legendre 

transformation hM{'i,Ci,7T) := (k,^) + {n,^) — iM{(,,Ci,^) where 

5iM 5iM 



5i ' 5v ' 

one finds the corresponding Hamilton equations for Km = hM{i^i0.i'^) '-Q* ^V* xV ^M. as 

d ,^ 5hM , 5hM 
—n = a.(lshj,j K 1 — oa + vro— — , 

dt TTT 5a OTT /g 

d 5hM 5hM d 5hM 5hM 
dt 5k 5a ' dt 5k 5tt 



These equations recover (6.4) for the case of a left G-representation. 

Note that the inverse Legendre transformation (assuming it is a diffeomorphism) is given 
by 5hM/5K = ^, 5hM/57i = 1^ and that 5hM 1 5a = —5iM 1 5a. 

The proof of the following theorem is a direct verification. 



Theorem 8.1. The equations of motion (8.1) are Lie-Poisson on (0(S)(K x V"*))* with 
the cocycle C : (V^ x V^*) x (V" x V^*) — j-M given by the canonical symplectic structure Qcan on 
T*V = Vx V* , where the Q-left representation onVxV* is given by (^,f)^-^■^^', {^,1^) ^ , 
for ^ Eq, veV, V eV* . Thus these equations can be written in matrix form as 



dt 



-□a 
-□vr 



-□Oa TTOD 

1 
-1 



SHm/^i^ 
5hM/5a 
SHm/^-k 



12) 
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The Clebsch optimal control approach 

Given a cost function £ : g x V* — )• M, the Clebsch optimal control problem with condition 
(A)' yields (for left representation) the stationarity conditions 

6i di 

— = -J(a,p)=poa, a = -^a, p=-^p+ — . (8.3) 

For a representation on the right, one replaces ^a, by a^, p^. These equations imply the 
Euler-Poincare equations of motion 

d 6i 6e di 

When condition (A) is assumed, we get the stationarity conditions 

— = J{a,p) = -poa, a = ia, p = ^p+— (8.4) 
and the motion equations 

d 5i „ 5i di 

dtrr^^'^^Trd-a''''- 

These are the Euler-Poincare equations for semidirect products, useful for the study of phys- 
ical systems with advected quantities; see H2]. 

Note that when the Lagrangian i is given by the kinetic energy associated to an inner 
product on g, the control is given by ^ = ±(poa)^, where tl : 0* — ^0 is associated to the inner 
product on q. We get the equations 

a + (poa)''a = 0, ^+(^00)^^ = 0. 



This is the abstract formulation of the double bracket equations] see ^8.1.2 below. 



The distributed optimization approach 

In order to state the optimization problem with penalty, we endow V* with a inner product. 
The corresponding functional is thus 

i{i,a) + ^\\a±ia\\\ 

The Levi-Civita covariant derivative V is the ordinary derivative; therefore we have 
V6^v*(a) = ^6 (resp. Vb^v/*(a) = 6,^), for all a.h&V*. We thus obtain the expression 
^^((a,f ), (a,6)) = — i;o6. If condition (A)' is assumed, the motion equations (3.33) read 



d 6i 
dil^ 
d ^ 
dt 



Si Si 1 
^ 04 oa a- 



-V ov 



resp. 



d 5i 
JtJl 



■ad 



5i 6i 
+ — oa + 



V ov 



-a 



6a 



-^v , z/ = a + ^a. 



d i, r,6i 

dt oa 



^ 6^ 5a 



1.5) 
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where \):V*^V is the flat isomorphism associated to the inner product on V*. When 



condition (A) is assumed, we have (see (3.32)) 

d 6i 6i 6e 1 

— adc — — —oa- 



dt6^ 

d L 



5^ 6a 



-V Of 



dt 



-V 



- o 



,6e 

6a 



resp. 



-.^u , u = d-^a, 




^adf — -^oa oi/ 

^ 04 6a a'^ 



b 2^^ 

— u —a — 
dt 6a 



(8.6) 



As we have seen in the general theory, these motion equations arise by metamorphosis 
reduction associated to the Lagrangian £(Mg,a) + ^||d|p. They can be obtained by the 
stationarity conditions (3.25), (3.26). In our case, for a left representation they read 



6J_ 



M 
da' 



As usual, to compare these conditions with the stationarity conditions (8.3), (8.4) given by 
the Clebsch optimal control approach, we define 



1 



and we get 



6_i_ 



■ ±poa, 



d = =F^a + crV 



P = T^P + 



di 
da 



•7) 



For a right representation one simply replaces ^a, C,p by a^, p^. 

When the Lagrangian i is given by the kinetic energy associated to the inner product on 
Q, the control is given by ^ = ±(^00)", and we get the equations 

d + {poa)^a = a'^p\ p+(poa)^p = 0. 

This is the abstract formulation of the double bracket equations, modified by the extra term 



cr p"; see i.8.1.2 below. Note that in the formula above, there are two different sharp opera- 



tors, tl:0*— ^0 and '^I'.V-^V*, associated to the inner products on q and V*, respectively. 
Note that, consistently with Theorem 3.9, if the inner product is G-invariant, then u^ou- 



0. This has already been noticed in the Remark 2^ of the introduction, for the case of an 
isotropic inner product. 

8.1.1 Heavy top 

Consider the evolution equations for a state system in the frame of a rotating body 

X = Xxri (8.8) 

for vector state and control variables X,r2 gM^ related to the rotation matrix O G 5*0(3) by 
X = 0^^z and fix =O^^Ogso(3). These vectors are, respectively, the vertical spatial axis 
as seen from the rotating body and the body angular velocity vector. 
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We choose to optimize a cost functional consisting of the difference between the rotational 
kinetic energy and the gravitational potential energy, subject to a penalty imposed by the 



state system (8.8). This cost function is 



Sd 







2a2 



X + rixX 

-iri ■ ri - mgv ■ x+ ^ 

2 2ct2 



dt 



X + f2xX|f dt, 



(8.9) 
(8.10) 



where m is the total mass of the body, g is the value of the gravitational acceleration, I is 
the real positive definite symmetric matrix of moments of inertia in the body, x is the center 
of mass vector in the body, and a is a real constant. The variation with respect to X defines 
the Legendre transform relation (costate variable) 

cr2p^: = X + rixX. 



The variation of the cost functional is given by 

' 'm+xxp]-sn-{p+nxp+mgx)-s:s. 



SS-- 



dt + 



P-6X 



1 T 



8.11 



(8.121 



The general system (8.7) takes in this case the following double cross form, involving the 
double cross product of vectors (X,P) gM'^ x M^, cf. equations (2.20), 



XxP)»xX = (T2p 



with 




(8.13) 



PxX) 



These equations correspond to the three equations in the general system (8.7), with the 
upper sign chosen. After denoting the angular momentum vector 11 G by 



n:=iri = Pxx, 



substitution of equations (8.14) into (8.13) yields 

U = UxU^-mgxy<X + a^PxP^ and ± + U^xX = a^P^ 
which can be written in matrix form as 



(8.14) 
(8.15) 



"n" 




'nx Xx Px" 




'dhM/dU 




"nx Xx Px" 




" n» " 


X 




Xx 1 




dhM/dX 




Xx 1 




mgx 


p 




Px -1 




dhM/dP 




Px -1 




a^ptt 



where 



/iM(n,x,p) = -n-n«+m^x-x+— p-p 



(8.16) 



(8.17) 



which suggests that one might regard the system (8.16) physically as a model of the motion 
of an ellipsoidal underwater vehicle, influenced by an external gravitational torque. These 
are equations (8.2) in this particular case. 
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Remark 8.2. The analogous extremal problem for compressible fluids is given by 



U,p 



^ ^ 1 



min / ( e{u,p) + ^\\p + diY{pu)f ]dt, 



where u is the Eulerian velocity and p is the fluid density in spatial representation. The 
advection law p + div(p'u) = (exact matching) is no longer imposed. Instead its expression 
||p + div(p'u)||^2 (inexact matching) is used as a penalty. Since this can be treated in a more 



general case, we defer this discussion to ^8.3.3 ♦ 



8.1.2 Adjoint representations 

We let G act on on the right on its Lie algebra q by the adjoint representation. The 
infinitesimal generator is thus = [x,^], the diamond operator is o: g* x q* ,pox = 
— ad*p and the momentum map is 3{x,p) = ad*p. 



The Clebsch optimal control approach 



The Clebsch optimal control (with condition (A), that is, x=[x,C,]) associated to a cost 
function i = i{^,x) yields the (generalized) Euler-Poincare equations 

d 6e 6£ di 
~rv7 = — + ad^ — . 
dt 6^ ^6^ dx 

We suppose that g is endowed with a bi-invariant inner product 7. This allows us to identify 
the dual Lie algebra with itself and to write ad*p= — In this case, the motion equations 
are 



d^di 



+ 



dx 



,x 



These equations are obtained from the stationarity conditions 



5e 



dx 



If the Legendre transform associated to ^ is a diffeomorphism, we can write ^ = and the 
equations take the form 



X - 



6h 



6[p,x] 



P-- 



6h 



P, 



S[p,x] 



+ 



de 

dx 



In the particular case where i is given by the kinetic energy of a bi-invariant inner product, 
one obtains the control ^= [p,x] and the double bracket equations 

x=[x,[p,x]], p=[p,[p,x]]. 



More generally, the Lagrangian £{^,x) = ^\\^\\^ — V{x) implies the motion equation ^ = 
see |H]. An interesting example is provided by the potential ^(x) = — 1|| |p; 
see [H] . For more discussion of the history, theoretical developments and other examples of 
double bracket equations, see, e.g., [28] . 
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The distributed optimization approach 

The penalty functional is defined on g x Tg and reads 

where the norm is associated to a inner product on the Lie algebra g. The associated 
equations of motion read 

dSi 5i di 1 , 
— — = ad. — + ad„ 7^ + ^ ad^ u . 

As above, we now suppose that q is endowed with a bi-invariant inner product 7 and we 
use it to identify the dual Lie algebra q* with q. In this case, the above equations become 



d Si 



Si 

Si 



+ 



di 
dx 



,x 



These equations are obtained from the stationarity conditions 

1 r T • r .1 . ^ .^ 2di 

— = —[v,x\, x = [x,i\ + v, v=[v,i\+a —. 
As usual, we define the variable p:— in order to rewrite these conditions as 

^ = x=[x,i]+a'^p, p=[p,^] + ^. 

As before, if the Legendre transform associated to £ is a diffeomorphism, we get 



X- 



X, 



Sh 

S[p,x] 



+ a p, p- 



Sh 



P, 



6[p,x]\ dx 



+ 



di 



If the Lagrangian i is given by the kinetic energy of a bi-invariant inner product, we get the 
control ^ = Now the double bracket equations are modified by an extra term: 

x=[x,[p,x]]+a'^p, p=\p,\p,x]]. 

Further investigation of this class of equations will be pursued in future research. 



8.2 Action by afRne representation 

We now consider the more general case where G acts on V* by a left affine representation, 
a\-^ ga + c{g) , where c : G ^ l^* is a group one-cocycle. In this case, the infinitesimal generator 
is 

ey*(a)=ea + dc(0 
and the cotangent bundle momentum map is 

J{a,v) = —voa + dc^{v). 

Affine representations play an important role for a comprehensive approach to the Hamilto- 
nian dynamics of complex fluids. Wc quickly give below the main equations arising in that 
case, in order to understand the influence of the cocycle. 
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The Clebsch optimal control approach for affine action 

The Clebsch optimal control problem (with condition (A)') yields the affine Euler-Poincare 
equations 

These equations appear naturally in the study of spin systems and complex fluids; see [27] . 



The distributed optimization approach for affine action 

The penalty function in the case of an affine representation is ||a + ^a + dc(^)p. The presence 
of the cocycle c does not modify the tensor fleld J-"^ and one flnds the motion equations 



lit 



As before, these equations can be obtained either by metamorphosis reduction, or by the 
stationarity conditions 

^ = i-(z.^a-dc^(z/^)), d = -^a-dc{0 + i^, u' = -^u' + a'^. 

Defining the variable p: = -^u^, we can write 

6i Si 
— =poa-dc^{p), d = -^a-dc(^) + cTV, P=-^p+— . 



When the affine term is not present, one recovers (8.7). If the Lagrangian i is given by the 
kinetic energy associated to an inner product, then the control is given by 

e = (poa-dc^(p))«, 

and we get the equations 

d+{poa-dc'^{p)fa + dc[{poa~dc^{p)f) = a'^p\ 

p+ (poa — dc'^(p))^p = 0. 



8.3 Actions by multiplication on Lie groups 

We now specialize to the case where Q = if is a Lie group, containing G as a subgroup. We 
will then apply the results to the rigid body and ideal fluids. 

Suppose that G acts on H by left (resp. right) multiplication. Given a Lie algebra 
element ^Gg, the inflnitesimal generator is 

^H{h)=TRf,^=:^h, resp. ^H{h) = TLf,^=:ht 

and the cotangent bundle momentum map 3:T*H^q* is 

3{ah)=i*(T*Rhah)=i*{ahh~^) resp 3{ah) =i*(T*Lhah) =i*{h~^ah), 

where i* : P)* — J-g* is the dual map to the Lie algebra inclusion z : g — )■ f). 
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The Clebsch optimal control approach 

Given a cost function i = i{^,h), and assuming the constraint (A)', that is, h = —^h (resp. h = 
—hC,), the Clebsch optimal control problem yields the (generalized) Euler-Poincare equations 

If the constraint (A) is assumed, that is h = C,h (resp. h = h^), the equations are 

dtrr'^'^'M''' \dh^~ '"'p- dtrr^'''^'6i^' v~ dh 

These equations are obtained by inserting the expression of the momentum map in equations 



(3.3) and (3.5). 



The distributed optimization approach 

The penalty functional is defined on g x TH and reads 

relative to a Riemannian metric on H. We will restrict to the case of an H -invariant metric. 
More precisely, given an inner product 7 on f), we consider the associated left (resp. right)- 
invariant Riemannian metric 7/1 on iJ, that is, we have ■jhiuh,Vh) ■ = 'y{h~^Uh,h~^Vh), resp. 
lh{uh,Vh) ■.= j{uhh-^,Vhh'^). 



Since G acts by isometries, the motion equation are given by (3.34) and (3.35). To com- 
pute these equations in our particular case, we need the concrete expression of the Levi-Civita 
connection associated to the Riemannian metric 'jh on H. It is written in terms of the iso- 
morphism ^fj : J^{H,\:)) ^X{H) given by ^{f){h)=TLhU{h)) (resp. ^(/)(/i) =m(/(/i))). 
For a vector field X G X(if), the Levi-Civita covariant derivative associated to the left (resp. 
right)-invariant extension of 7 to is given by 

V.,X{h) = TU [dfivu) - \ adt f{K)-\ ad;^,) v + \ \vj{h)\^ , 

resp. V,,X(/i) = Ti?, (^d/(t;,) + ^ adt /(/i) + \ ad}(,) v - \ \vj(h)\\j , 

where v:=h~^Vh (resp. v:=Vhh~^), f = ip~^{X), and ad^ is the transpose of ad^ with respect 
to the inner product 7 on f), see |19], Section 46.5. 

We now specialize these formulas to the case where the vector field X is given by the 
infinitesimal generator C,h- In the case of multiplication in the left, we have X{h) = C,H{h) =C,h 
and f{h) = Ad/j-i^. Thus we obtain 

V.,e/.(/i)=TL,(^-r'^^h,/W]-^adt/(/^)-^ad;(,)t; + ^[t;,/(/i)] 



1, 



- TLJ[v,f{h)]+^dlf{h) + ^d 
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For right-invariant metrics, we have C,H{h) = hC,, f{h)=AdhC, and the previous formula be- 
comes 



"^v.^hW = -TR, ( [v, f{h)] + adi f{h) + ad 



t , 



When condition (A)' is assumed, the motion equations are (see (3.35)) 

di 



resp. 



f d 6i 
J dtl^' 




1 ^ 








d 6i 
~dtJl~ 




D 









dh 



^-TL, {[u, f{h)] + ad}(,) u + adt /(/i)) e T,H 



(8.19) 



i* h 



dh 



, Uh = h + h^ 



^TRh (h f{h)] + ad}(,) u + adi /(/i)) G T,i7, 



(8.20) 



and the stationarity condition (3.25) is 
Si 1 



s^ 



a 



;i* [vfji ^), Uh = h + ^h, resp. 



Si 

Si 



If the constraint (A) is assumed, then we have (see (3.34)) 




■adj — + « 
Si 



di 



resp. 



2di^ 
Di'''-'' dh 



Si 



: - ^TL, {[uj{h)] + ad}(,) u + adt f(h)^e T.H 



(8.21) 



d Si 
dtSi 

D 

Dt'''-'' dh 



idi\ 



.di^ 



^TRf, {[u,f{h)] + ad}(^) u + adt /(/.)) ^ jf^H, 



(8.22) 



and the stationarity condition (3.25) is 

77 = -2** ('^^^"^) ' i^h = h-ih, resp. ^ = ^i* (/i" VJ^) , Ph= 'h-hi. 

From the general theory developed in Section [5| these equations can be obtained by metamor- 
phosis reduction, starting from the G-invariant Lagrangian L = L (jj,g,f,f^ ■ T{G x H) — j-M 
given by 

L{g,gjj)=C{g,gJ) + ^^\\f\\\ 

where £ : TG x if — )■ M is the G-invariant function associated to I. One can pass from the 
Lagrangian variables ((?,/) to the reduced variables {C,,i^h) via the map 



^g,gJJ)^ii,i'h)-=(g ^g,g V 

for example. Note the relation h = g~^f. 
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Remark 8.3. Note that if the inner product 7 on f) is bi-invariant, then ad^ = — ad and the 
equations above simphfy. ♦ 

8.3.1 The A^-dimensional rigid body 

We now apply the above results to the Lie groups G = SO{N) and H = GL{N) in order to 
obtain the distributed optimization approach for the iV-dimensional rigid body. Of course, 
the more interesting case happens for A^ = 3. We let SO{N) act on GL{N) by multiplication 
on the right. Given Q G GL{N) and U &5o{N), the associated infinitesimal generator is given 
by Ugl{n){Q) = QU ■ The Lagrangian of the rigid body is of the form i{U) = ^{J'{U),U), 
where j7':so(A^) -^5o{N) is a symmetric positive definite operator of the form 

j(^U) = UJ + JU, 

where J is a diagonal matrix verifying Jj + Jj>0 for all i^j. 

The Clebsch optimal control approach 

Using the constraint Q = QU, the Clebsch optimal control problem yields the motion equa- 
tions 

M=[M,U], M=^=^-JiU) 

of the A^-rigid body, where we identified the dual Lie algebra so{N)* with so{N) via the 
pairing {P,V) :=Tt{P^V). Of course, when A^ = 3 and identifying so(3) with (]R'^,x), we 
recover the classical Euler equations M = M x U. Using the same pairing as above to identify 
the tangent and cotangent spaces, we obtain the expression J{Q,P) = ^(^Q^P — P^Q) for 
the momentum map. This yields the stationarity conditions 

U = J-^[QTp-pTQ^^ Q = QU, P = PU. (8.23) 

The two last equations are referred to as the symmetric representation of the rigid body; see 
i, [?!, 0, and m- 

Note that one can also let SO{N) act on the vector space qI{N) instead of the group 
GL{N), with the same results. 

The distributed optimization approach 

The penalty term reads — Qf/|P, where the norm is associated to a Riemannian metric 
on GL{N), and one needs to minimize the functional 

\{JiU),U) + ^JQ-QUryt. 

If the Riemannian metric on GL{N) is ng'/ii-invariant, the associated stationary conditions 
and equations of motion can be computed with the help of the general formula derived 



above. In particular, one needs to use equation (8.22). Since the resulting equations are 
complicated, we do not pursue this approach here and leave it for the interested reader. 
This route will be taken for the ideal fluid equations below. 
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An alternative approach is to consider the action of SO{N) on the vector space qI{N) 



instead of the Lie group GL{N). In this case we can apply the results of ^8.1 and we suppose 
that the norm involved in the penalty term is associated to a inner product on qI{N), but is 
not necessarily right-invariant. The minimization problem is the same and one obtains the 
stationary conditions 



-J{U) = -u^oQ, Q = QU + u, 



where the diamond operator is given by PoQ = —3{Q,P) = —^(^Q^P — P^Qy Here, b: 
qI{N) ^ qI{N)* c::^ qI{N) is the flat isomorphism associated to the inner product on gl{N). 
Recall that the dual space 0t(A^)* is identified with 0t(A^) via the pairing Tt{P'^V), but the 
inner product can be different from this pairing. As usual, we define the variable 



P: 



and we rewrite the above conditions as 



U = J-'{Q^P-P^Q), Q = QU + a''P\ P = PU. 
These equations should be compared to the symmetric representation of the rigid body 



(8.23). The equations of motion for M take the form 

M=[M,U]-a'^PoPK 

Thus we get the system of equations 

M = [M,U]-a^PoP'^, 
P = PU, 
Q = QU + a^pi, 



(8.24) 



which is analogous to the system (8.2) for a right action and left reduction. Therefore 



this system is Lie-Poisson on the dual of the Lie algebra so{N)(§){Qi{N) xgl{N)), with a 



symplectic 2-cocycle on the latter product. See also (2.17) and (2.24) 



8.3.2 Euler fluid equations 

Hamilton's principle for ideal fluid flow might be summarized by saying that water moves 
as well as possible to get out of its own way [62]. The question pursued in [36] was whether 
Euler's fluid equations represent optimal control, or only optimization. As it turned out, 
the geodesic flow represented by the Euler's fluid equations was found to arise from either 
formulation. An optimization method used in image-processing (metamorphosis) is found to 
imply Euler's equations for incompressible flow of an inviscid fluid, without requiring that the 
Lagrangian particle labels exactly follow the flow lines of the Eulerian velocity vector field. 
That is, an optimal control formulation and an optimization formulation for incompressible 
ideal fluid flow both yield the same Euler fluid equations, although their Lagrangian parcel 
dynamics are different. This is a result of the gauge freedom in the definition of the fluid 
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pressure for an incompressible flow, in combination with the symmetry of fluid dynamics 
under relabeling of their Lagrangian coordinates. 

We apply here the result of this section to the Lie group H = DiS{V) of all diffeomor- 
phisms of the manifold V and its subgroup G = Diff„oi(P) of volume preserving diffeomor- 
phisms. We shall recover and extend the approach given in |36j. Recall that a curve 
rit&DiSyoi{1^) represents the Lagrangian motion of an ideal fluid in the domain V, that 
is, the curve rit{x) in V is the trajectory of the fluid particle located at x at time t = 0, 
assuming that r^o is the identity; rjt is referred to as the forward map. The Lie algebra of 
G consists of divergence free vector flelds on V parallel to the boundary and is denoted 
by g = Xyoi{1^)- The curve 77^ is the flow of the Eulerian velocity Ut&XyoiiJ^), that is, we 
have r]t = Utor]t. The curve lt: = r]^^ is called the back-to-labels map. (See, e.g., [22] where 
the name "back-to-labels" was introduced and the map was used as a sufficient variable to 
describe and analyze the incompressible Euler equations.) The back-to-labels map is related 
to the Eulerian velocity Ut via the relation lt + TlfUt = 0. 

As is well known, a curve rjt e Diff^o;(X') is a geodesic with respect to the right invariant 
Riemannian metric if and only if ut is a solution of the Euler fluid equations 

dfU + u-Vu = — gradp. 

In other words, the Euler fluid equation is given by the Euler-Poincare equation on X^oiil^) 
associated to the Lagrangian i{u) = ^Jjj WuW^dx. 

First approach - composition on the left: We let the group G = DiS^oi{1^) act on H = DiS(T>) 
by composition on the left. The inflnitesimal generator is thus given by MDiff(i')(v^) = woi^j. 

The Clebsch optimal control approach 

Using the Lagrangian i{u,ip) = i{u) = ^ Jj^WnW^dx = ^\\u\\'j^2 and the constraint ip = uoip, the 
Clebsch optimal control problem yields the Euler equations 

dtu + V„-u = — gradp. 

Note that here there is no dependence of C. on the variable if, therefore the Clebsch approach 
yields the standard Euler-Poincare equations. The stationarity conditions are 

m'' = J(V9,7r) =P(J(/9^"^(7rO(^^-'-)), (p = UOip^ TT = —{T*UOlf) -TT, (8.25) 

where P: fl^{V) -^fl\^^{V) is the Hodge projector and J(p is the Jacobian determinant of (p. 
Here we have chosen the pairing between one-forms and vector flelds on V and b denotes 
the index lowering operation deflned by the Riemannian metric on V. 

The distributed optimization approach 

The penalty term is — woy^H^a, where the norm is taken with respect to the /e/i-invariant 
metric on Diff(D), and one needs to minimize the functional 
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among all curves u{t),(f{t) in Xdiv{1^) x Diff(I'). The stationarity condition (3.25) reads 

where F : fl^ (V) ^ Q\-^(V) is the Hodge projector onto divergence free forms. In our case, 
the equations of motion are given by (8.21) and we get 



(9tM + V„n = -gradp, (p = uo(p + i^^ 
D 



Dt 



(8.26) 



where h':=Tip ^ou^ and J-'(t',z/) = ^ (grad(y'(f ,z/) + z/divt> + t>divz/). To see this, we compute 
the right hand side 

-iTL,(h /(/.)] +adt(,)Z. + adt /(/.)) 
of the second equation in (8.21). We have 

= gradf/ (f , z/) + 2 V^i^ + div w + div J/ 

since ad]jm = VM'^m + V„m + mdivu. By choosing iy:=TL^-i(iy^) =T(p~^ ou^p and v: = 
Ad^-iu = T(p''^ ouo(p = (p*u, we obtain the result. Note that ip*u is an analogue of the 
convective velocity, but recall that (p is not the flow of u. 

As usual, the stationarity conditions can also be expressed in terms of the variable 7r: = 
^z/^. They can alternatively be written as 

= 3{(p,'iT) = F{Jip^^{'n-oif^^))^ (y9 = 'UO(^ + (T^7r', TV = —{T*uoif) -Tt + a'^S{7r), 



in order to be compared to (8.25), where S denotes the geodesic spray of the left invariant 
Riemannian metric. Here tj:=t'^^. 

The equations (8.26) can be obtained by metamorphosis reduction for the Lagrangian 
defined on G T(Diff^o,(P) x Diff(P)) by 



+ 



2a2 



2 

L2) 



where the norms are associated to the right and left invariant extension of the inner 
product, respectively. This Lagrangian is invariant under the tangent lift of the right Diff^o;- 
action given by 

The link between the Lagrangian variables {f],Ti,f,f) and the reduced variables (m,z^<^) is 
given by the reduction map 

In particular, we have ip = rio f. 
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Note that the operator D/Dt denotes the covariant derivative with respect to the left- 
invariant Riemannian metric on Diff^o;(D) and does not have a simple expression, in 
general, contrary to the covariant derivative associated to the right-invariant Riemannian 
metric, which is simply given by functorial lift. As we will see below, for the penalty approach 
to the Euler equations, it is more convenient to work with the back-to-labels map. 

Note that instead of iJ = Diff(P), one can use the subgroup if = Difft,o;(P) of volume 



preserving diffeomorphisms. In this case, the second equation in (8.26) simplifies to 

i/^ + T9? o V<^.„i^ = 0. 

Second approach - composition on the right: We now let the group G = DiS^ai{1^) act on H = 
Diff(T') by composition on the right. The infinitesimal generator is thus given by UDiff{D)(0 = 
Tlou. Recall that the back-to-label map / is related to the Eulerian velocity u by the formula 
1 = —Tlou; therefore, we need to impose condition (A)'. 

Clebsch optimal control approach 

Using the same Lagrangian i{u) = ^ Jj^WuW^dx as before, and imposing the condition / = 
—Tlou, (condition (A)'), the Clebsch optimal control problem yields the Euler-Poincare 
equations on Xyoi(T>). We thus recover the Euler fluid equations 

dtu + u- Vm = — gradp. 

The associated stationarity conditions are, |36i 

u^ = -F{noTl), i = -Tlou, 'k = -TiTou. (8.27) 



In analogy with equations (8.23) for the rigid body, these equations are referred to as the 



symmetric representation of the Euler fluid equations. 



The distributed optimization approach 



The penalty term reads ||Z + TZoti||^2 where the norm is taken relative to the right-invariant 
metric on the group of diffeomorphisms. Therefore, we minimize the functional 



f (^ll"lli. + ^ll'+r/o.||i,)*, 



among all curves u{t),l{t) in XdiviT^) x Diff(D). The stationarity condition (3.25) reads 



^ = -lpWoT/) = -l.NT/-dfc, 
where vi:=l + Tlou and the associated equations of motion are 



dtu + V„M = — gradp, 

D (8.28) 
—yi + Vui'i = -{gr&dq)ol. 
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These equations are obtained by computations similar to those above, but using (8.20) 



instead of (8.21). In particular, the right hand side of the second equation of (8.20) becomes 



-(Vi,i/)o/- (gradg)o/ = -V„i/i- (gradg)o/, q = ^g [l^u,u^ol ^) , 

since we need to choose v:=vio and v = Adiu = Uu. Note that v is the convective velocity 
of the fluid. As usual, the stationarity conditions can also be expressed in terms of the 
variable n := -^u^i. They can alternatively be written as 



u 



-J(/,7r) = -P(7roT/), / = -TZoM + aV, n = -Tn ou + a^S{7i 



in order to be compared to (8.27), where S denotes the geodesic spray of the right invariant 
Riemannian metric. 



The equations of motion (8.28) can be obtained by metamorphosis reduction of the 



Lagrangian defined on {urj,Uf) G T(Diff^oi('D) x Diff(X')) by 



1,, ,, 1 

-\\Ur,\\L2 + 



2(t2 



2 

L2) 



where the norms are associated to the right invariant extension of the inner product. 
This Lagrangian is invariant under the tangent lift of the right Diff^oraction given by 

The link between the Lagrangian variables {f].,'r],f,f) and the reduced variables {u,i>i) is given 
by the reduction map 

In particular, we have l = for]~^. 

Working with H = Diff„oi(P) instead of the whole diffeomorphism group, yields the second 



equation of (8.28) in the simpler form 



D 
Dt 



J^i + V„z/i = 0. 



These results recover Theorem 10 in 



8.3.3 Optimization dynamics of a compressible fluid 

For the compressible fluid, we choose to minimize the functional 

T 

S,= I (^£(u,p) + ^||/ + T/oM||22 + ^||p + div(pu)f 



over all curves u{t),l{t),p{t) in x Diff (D) x Dens(P). This minimization involves penal- 
ties and tolerances at two levels. We seek the stationarity conditions implied by optimization 
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of the functional Sd, subject to homogeneous endpoint and boundary conditions. We intro- 
duce the notation, 



ou op 

n:=\{i + Tlou)^ eT*Bm{V), 0: = 
^1 

then write the stationarity conditions as: 



a' 



(p + divpw) eC°°(r'), 



(8.29) 
(8.30) 



5u 
51 
6p 



m + 7T oTl — pd(j) = 0] 
7r + div(7r'u) = 0; 
(f) + d(j)ou — w = 0. 



(8.31) 



Combining these equations into Hamihonian form yields (in index notation for clarity) ex- 
plicitly, in terms of indices and differential operators, 



di 



mi 




6hM/Sm,j 


= u^ 


P 




5hM/Sp = 


—w 





= -B 


6hM/S4> = 


0-10 








= 






5hM 15713 = 





where 



E- 



'mjdi + djmi pdi -0, 
djp -1 
1 





dj7!-A 



B 




-1 

1 



(8.32) 



(8.33) 



Here, the summation convention is enforced on repeated indices. Upper Latin indices refer to 
the spatial components of the inverse map, lower Latin indices refer to the spatial reference 
frame, and subscript-comma notation is used for spatial derivatives. The partial derivative 
dj = d/dxj, say, acts to the right on all terms in a product by the chain rule. The Hamiltonian 
whose variations are taken in (8.32) is given by 



hMim, p,(j),l, 7r) = h{m,p) + y - 



ax 



8.4 A^-dimensional Camassa-Holm equation 

In this section we apply the distributed optimization method to the A^-dimensional Camassa- 
Holm equations 

V + u-Vv + Vu^ -v + vdivu = , v := (l — a"^ A)u , 

which are the spatial representation of the geodesic spray on the group Diff(D) of all dif- 
feomorphisms of V, relative to a Sobolev metric; see [39] • They are thus obtained by 
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Euler-Poincare reduction and represent a particular case of the well known EPDiff equations, 
to which the approach described here generalizes easily. For simplicity, we assume that V 
has no boundary. 

By analogy with the Euler equations, we shall give two approaches, namely, by compo- 
sition on the left and on the right. However, in the case of the Camassa-Holm equations it 
is convenient to slightly generalize the previous setting by letting the diffeomorphism group 
act on a space of embeddings. More precisely, we first consider the left action of DifF(T') 
on the space of embeddings Emb{S,V) of a manifold S into V and obtain the distributed 
optimization for the cost function 

^^Q||«IIhi + ^IIQ-«°QII')^^, QeEmh{S,v). 

Then, we let Diff('D) acts on the right on the space of embeddings Emb(P,M) of a manifold 
P into a manifold M and obtain the cost function 

IWuWm + ^H + Tqouf^dt, qeEmb(P,M). 

8.4.1 Left action of diffeomorphisms on embedded subspaces 

Consider the left action of the configuration diffeomorphism group G = Diff(I?) on Q = 
Emb(5',P). The infinitesimal generator associated to a Lie algebra element uGj£(P) reads 
UEmh{s,v){Q) = uoCl and belongs to the tangent space TQEmb(S',T'). 

The Clebsch optimal control approach 

Using the Lagrangian £{u,Q) =(i{u) = \ j-pWuW^idx and the constraint Q = mo Q, the Clebsch 
optimal control problem yields the A^-Camassa-Holm equation; see Section 4 in [28]. Note 
that here there is no dependence of £ on the variable Q, therefore the Clebsch approach 
yields the standard Euler-Poincare equations. The stationarity conditions are 



5u 



■■3{Q,P)= [ P{s)6{x-Q{s))dsen\V), Q = MoQ, P = -(r*MoQ) -P. 
Js 



The last equation can also be written as 

^p=-((v«roQ).p, 

where D/Dt denotes the covariant derivative associated to the Riemannian metric on V. 

The distributed optimization approach 

The proposed associated cost function is 

Sd= ! {£{u) + ^\\(:i-uoCi\\l,yt. (8.34) 
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For definiteness, we rewrite this expression more explicitly as 

Sd= [ + ^ /"|Q(t,s)-n(t,Q(t,s))prfs)rft, (8.35) 

in which, for simplicity, | ■ p denotes the norm of vectors in TV defined by the Riemannian 
metric on V and ds denotes the volume form on S. There could also be a sum on integrations 
over some finite number of embedded submanifolds of various dimensions, but this possibility 
is unimportant in the subsequent reasoning, so it will be suppressed in the notation. 

The choice of the reduced Lagrangian i{u) will be left unspecified, except that sufficient 
smoothness will be assumed for the variational calculations manipulations to make math- 
ematical sense, at least in terms of weak solutions. With these assumptions we have the 
following result. 



Theorem 8.4. The extremals of Sd in (8.35) are given by 



P{t,s)6{x-Q{t,s))ds, Q = MoQ + a2ptt, p = - (^(Vm)^oQ) -P, (8.36) 

^ s 



where Q G Emb(S',r'), P^ G TQEmb(S',r'), and D/Dt is the covariant derivative of the Levi- 
Civita connection on T>. 



Proof. We can obtain these conditions directly from the general equations (3.28). However, 
it is also instructive to derive them directly from the variational principle. 
Consider the variations e^u^ and e i— )■ and define P" by 



cT^P":=Q-MoQ. 



For 5u - 
SSd-- 



rfiL=o"^ and 6Q= d-e\s=o^e, we have 
dt 



T 







T 



JS 



P(t,s), 



dt + 



D 

De 

T 



Q,(t,s)-5n(t,Q(t,s))- 



D 



e=0 



JS 



P(t,s), 



D d 
Dt de 



De 

Q,£{t,s) )dsdt 



u{t,Q,fr{t,s)) jdsdt 

e=0 I 



£=0 







5u 



[ [ {P{s)6{x-Cl{t,s)),6u{t,x))dsdxdt- [ [ {P{t,s),VsQu{t,s))dsdt 
JvJs Jo Js 

Pis)5{x -Clit,s))ds,5u^ dt- ^ ^-^P + {(VufoQ) . P,5Q^ dt 



+ 



J 



The stationarity conditions follow immediately, upon noting that (5Q(0,s) = = (5Q(T,x), so 
that temporal endpoint terms arising under integrations by parts may be ignored. ■ 
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Suppose the reduced Lagrangian defines a velocity norm, £{u) = ^WuW^ = ^{u,Qop{u)). 
For example, let the norm be a Sobolev norm, so that it makes sense for its variational 
derivative in u to result in a singular distribution defined on an embedded subspace. Then, 
the density equation 



— ( 
6u 



f 

-(t,x)= / P(t,s)S{x — Q(t,s))ds=:{Qopu)(t,x) 
'■ Js 

has a natural dual solution for the velocity, given by 



(8.37) 



u{t,x) 



P\t,s)G{x-Q{t,s))ds 



where G is the Green's function for the positive self-adjoint operator Qop, that is 

QopG{x-Q{t,s)) = 6ix-Qit,s)). 



(8.38) 



(8.39) 



In this situation, we have enough assumptions to obtain a coupled system of equations for 
the momentum densities P(t,s) and m{t,x). 



Theorem 8.5. The system of variational equations (8.36) for the minima of S in (8.35) 



implies the following dynamics for the momentum densities P{t,s) and m{t^x), 



dtV + WuV + Wu-v + vdiv{u) = -a'Div / P»(g)P»(t,s)(5(x-Q(t,s))cis 

Js 

D 

Dt' 



-P+((Vm)^oQ).P = 0, 



(8.40) 
(8.41) 



where Div denotes the divergence of a contravariant two-tensor field on T). The remaining 
decoupled equation 

Q = noQ + (T2ptt 

allows reconstruction of the Lagrangian coordinates Q(t,s) on the embedded surface(s) from 
the dynamics of the coupled equations for the momentum densities m{t,x) and P{t,s). 



Proof. Substitution of equations (8.36) and definitions (8.38)-(8.39) into the definition of 



the momentum m in equation (8.37) verifies its evolution by (8.40), upon pairing with a 



smooth test function and integrating appropriately by parts. 

Alternatively, one can use the abstract formulation of the dynamical equations given in 



(3.32). As recalled before, the Euler-Poincare part of these equations gives the A^-Camassa- 



Holm equation 

+ V„f + Vu^-t> + t>div(n) = 0. 

Thus, it remains to compute the expression of the tensor J^^. Let P^ eTQEmb{S,V) and 
u G ^(T>), and choose X G X(V) such that P^{s) =X(Q(s)). Using the fact that the covariant 



derivative on Emb(S',P) is the functorial lift of the covariant derivative on D, using (3.31) 
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we get 



(j-^(P,P«),^z) = (P,Vp«ME^b(S,75)(Q)>= / (?(P«(s),Vp«(,)M(Q(s)))rfs 

Js 

= g{X{x),Vx(x)u{x))6{x-Q{s))dxds 
J sJv 

= - j g(^j Div {X{x)®X{x)5{x-Cl{s))),u{x)^dxds, 

where we make use of the identity 

[ g{X,VYu)dx = - [ g(Biv(Y®X),u)dx, for all X,F,MGX(r'), 
Jv Jv 

where Div(T)-' = VjT*-' , where T = T^^ ^ is a contravariant two-tensor on V. We thus 
obtain the formula 



J^^('P,P" 



- ! Div(X(x)®X(x)5(a;-Q(s)))rfs 
Js 

-Div f X{Cl{s))®X{Cl{s))5{x-Cl{s))ds 
Js 

-Div f (P«(s)®P«(s)(5(x-Q(s)))c/s 



5 



as required. 



Remark 8.6. Equations (8.40) and (8.41) represent a new dynamical system, whose explo- 



ration has only just begun and we expect will be a subject of future research. ♦ 
8.4.2 Back-to-labels map for fluids 

We next present the optimal control derivation of the Camassa-Holm equation using the 
back-to-labels map. This means that we shall use the right action of Diff (1)) on Emb(T',M). 

Recall that particles frozen into an ideal fluid flow are represented by time-dependent 
vector labels It whose components each satisfy the advection law obtained from the time 
derivative of the back-to-labels map, lt{x) ■. = r]^^(x) = l{t,x), and hence it satisfies the equation 

i + Tlou = 0, (8.42) 

where u is the Eulerian velocity of the fluid. 

We shall slightly generalize the back-to-labels map by considering embeddings q : P — )■ M, 
where M is a given Riemannian manifold, instead of diffeomorphisms l:T) ^T). 

The Clebsch approach 

We recall from [28j how one can obtain the Camassa-Holm equation by Clebsch optimal 
control via a generalization of the back-to-labels map. 
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Let the group G = Diff(D) acts freely on the right on the manifold Emb(P,M). The 
associated infinitesimal generator reads UEmb{v,M)iQ) = Tqou. Using the Lagrangian £{u) = 
and the constraint q + TqoM = we get the stationarity conditions 

Si 

— = — p-Tq, q + TqoM = 0, p + rpo'u = 0. 
ou 

These equations produce the Camassa-Holm equation if one uses the Hamiltonian 
iJ(q,p) = - J J p{x)-Tq^(x)G{x — x')p{x')-Tq(x')dxdx' . 

Distributed optimization 

As opposed the Clebsch approach, we do not impose qt + Tqtou( = 0. Instead we use ||qt + 
TqfOM^||^2 as a penalty, that is, we consider the cost functional given by 



T 

5'^^= / {i(u 



+ ^ Jq + Tqou\\l,) dt. (8.43) 
° Penalty 

Thus we need to minimize 5*^ subject to spatial boundary conditions, endpoint conditions 
(q(0,x) and q(T,a;) are prescribed), and penalize for the error in the norm, 

\\il + Tq_ou\\l2= / \q{x)+Tq{u{x))fdx (8.44) 

in which, for simplicity, | ■ p denotes the norm of vectors in TM defined by a Riemannian 
metric on M. It is important to note that the Riemannian metric used in the penalty is 
not invariant under the right Diff (T')-action on itself. 

Remark 8.7. (An alternative penalty term) If M = V, the quantity 

v:= —lol~^ =Tlouol^^ = l^u = Ad;ti 



is called the convective velocity [40] of the fluid. This is analogous to the relation Q = Adp-^uj 
for 6 5*0(3) satisfied by body angular velocity Q and spatial angular velocity u for rigid 



body motion in M^, both viewed as elements of so(3). Penalizing in (8.43) for Ht; — Ad/M||^2 



is an interesting alternative approach, which will be presented, in general, in ^8.5.1 



Let a^>0 and choose the reduced Lagrangian to be a norm £(«) = Then, when 



extremals of (8.43) exist, they will be minima. 

Later we shall specialize the reduced Lagrangian to the norm £(m) = ^HmH^i. For the 
moment, however, we leave the choice arbitrary, only assuming that sufficient smoothness is 
present for all functions to exist locally and be differentiable in space and time. With these 
assumptions we have the following result. 
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Theorem 8.8. The extremals of Sd in (8.43) are given by 

- — |-7roTq = 0, — — + Div(7rM) = 0, q + Tqow =: a^vr", 
ou Dt 

where the expression Div(7r'u) G T*Emb(I',M) is defined by 



145) 



Div(7rn) := (divM)7r + VuTT, with Vu^ 



D 



vr(c(.)) 



e=0 



De 

for Ux eTxV, ei-^c{e) a curve such that ■^\^^^c{e) =Ux, and ^ is the covariant derivative 
of the Levi-Civita connection of the Riemannian metric on M . 

Note that if M = endowed with the Riemannian metric given by the dot product then 
Div(7rM)j = div(7rj'u). 

Proof. Define vr G TqEmb(I^,M) by cr^vr := q + Tqo-u. For variations e^Us and eh-j-qe, we 
compute 



ou )dt+ I \ ^ 



6u I 
\6u 
Su 



(qe + TqeOUe 



e=0 



T 



,5uJ dt- 
+ 7roTq,(5n 



D 



n,—6q + Tqo5u + Vu5q 



T 



D 
Dt 



7r + Div(7rM),5q 



where in the last equality, we used integration by parts and the definition of Div. 



Theorem 8.9. The system of variational equations (8.45) for the minima of Sd yields the 
following dynamical system for the momentum n and momentum 1-form := 5i/5u = —tc o 



dtV + VuV + Vu'^-v + vdiv{u) = a^{Vn)'^ -tt^ 
dt'iT + Div (ttu) = 0, 



(8.46) 
(8.47) 



where Div(7r'u) is defined above. The decoupled equation cr^vr" = q + Tqo-u allows reconstruc- 
tion of the labels q from the dynamics of the coupled equations for v and it. 

Proof. One can directly obtain these equations from the stationarity condition given in 
(8.45). We shall however use the abstract formulation (3.33) and compute the tensor field 
J"^ defined in ( |3.3lD . Given n eT*Emh{V,M), ueX{V), and a curve q^ G Emb(P,M) 
such that ^ l^^^q^ = tt", we have 



(j^^(7r,7r*),M) = (7r,V^«MEmb(©,M)(q))=y" g(^T^\x), ^ 

g (vr^ (x) , V„7r (x)) dx = { ( Vvr^) -ir^u) 



Tqe(M(x)) J dx 



e=0 



V 



which proves (8.46). Equation (8.47) is part of the system (8.45). 
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Remark 8.10. (Two-component Camassa-Holm equation) If D = M, M = ]R, and we assume 
appropriate decay properties at infinity such that all boundary terms appearing in integration 
by parts vanish, specializing the reduced Lagrangian to 



e{u) 



1 



u 



|2 



with homogeneous boundary conditions on the infinite real line or on a periodic spatial 
interval, yields variational derivative {5l/5u)^ = v = u — a^Uxx^ for a length scale a. In this 



case, equations (8.46), (8.47) recover the two- component Camassa-Holm equations, 



dtV + (UV):^ + VUx = (j'^TlTTx , 



dtTT + (uTt) 



0. 



(8.48) 
(8.49) 



This system forms a completely integrable Hamiltonian system with soliton solutions asso- 
ciated to an isospectral linear eigenvalue problem, so it may be solved analytically by using 
the inverse scattering transform method [211 ED]- These equations are also known to be the 
spatial representation of geodesies on the semidirect product Diff (]R)(s)J-'(]R); see jl5], [30] . 



8.5 Metamorphosis dynamics 

Consider a Lie group G acting on the left on a manifold N. The Lie group G is the group of 
deformations and the manifold N contains what are called "deformable objects" . In imaging 
applications we take G to be the group of diffeomorphisms of N. 



Definition 8.11. A metamorphosis ([66^ 116]) is a pair of curves [gt, rjt) EG x N parame- 
terized by time t, with go = id. Its image is the curve UtEN defined by the action nt = gtr]t 
denoted by concatenation from the left. The quantities gt and rjt are called, respectively, the 
deformation part of the metamorphosis, and its template part. When rjt is constant, 
the metamorphosis is said to be a pure deformation. In the general case, the image is a 
combination of a deformation and template variation. 



A metamorphosis may be determined as an optimal curve {gtirjt) with gtEG and rjtEN 
with respect to a metric that is invariant under the right action of G on G x N defined by 

{g,r])h={gh,h-'r]) (8.50) 

for any g,hEG and rjEN. More specifically, a metamorphosis {g,ri) may be obtained by 
seeking a stationary point 6S = of a right-invariant cost function S on T{GxN). This 
general situation has been considered in detail in the first sections of the paper. 



The present conventions are those of equations (3.32) with the upper sign chosen. Recall 



in particular, that we start from a right G-invariant Lagrangian of the form 

1 



L{,g,g,r],rii) = C{g,g,vi) + — \\gf] 



|2 



2(T 
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where the norm involved in the penalty is associated to a Riemannian metric g on A^. The 
corresponding reduced Lagrangians on x TN read 

(-Ep{u,Vn) ^i{u,n) + ^lli^nir, (-M{u,n,h) ^£{u,n) + ^||n-iiAr(n)||^, 
where the reduced variables are 

u = gg~^eQ, n = g7]eN, Pn = gf]eTnN 

with geG, rjeN. 



8.5.1 Subgroup actions 



We shall discuss in this paragraph the particular case in which is also a Lie group that 
contains G as subgroup and on which G acts by multiplication on the left. We also assume 
that the Riemannian metric g on N is left invariant (relative to left translations by elements 
of N). In this case, one can make use of left trivialization of the tangent bundle TN to get 
the diffeomorphism 

Q X TN ^QX N xn, {u,n,n) {u,n,n~^n) =: {u,nX)- 

The reduced Lagrangian in terms of the new variables is denoted £l and reads 

eL{u,n,C)^iiu,n) + ^\\C-Adn-iuf 

since we have the relations 

n~^i'n — n~^{h — UN{n)) —n~^{h — un) —C^—Kdn-iu. 

We now rewrite the stationarity conditions relative to these new variables. Consider varia- 
tions e^Ue and e^rie of the curves u and n. We have as usual 



(8.51) 



where E = n 5n. Likewise, 



(5(Ad„-i'u) = Ad„-i {5u + \u,5nn ^]) = Ad„-i {5u+ [M,Ad„E]) 

= Adn-i5u + [Ad„-i -u, S] . 

For simplicity, we suppose that ^ docs not depend on n. Substituting these relations into 
the variation of the action integral we get 



dt 



6e_ 

Si 

6i 
6u 



5u) + {'K,SC-S{Adn-iu)) ]dt 



,6u 



+ (^TT, E + ad^E - Adr,-i5u - ad^^j^_^^^ E 



dt 



- Ad^-iTT, 5u ) - \7r - ad^TT + ad^^^^_^^-j tt, E H + ^ (tt, E) 
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where vr G n* is the image momentum dual to the left-invariant image velocity C&n, that is, 

Stationarity 55" = and S(0) = S(T) = then implies 
6i 

— = Ad*-i7r and 7r = ad^7r — ad*^^ _^„)7r = ad*2^tt7r = (T^ad*B7r. (8.52) 

From the general theory, since the G-action on N is by isometries it follows that J-"^ = 0, 
and thus these equations imply the Euler-Poincare equations. It is also instructive to obtain 
them directly. Taking the time derivative and using general results relating the Ad* and ad* 
operations yields 

= Ad*-i ^77 — ad^vrj (with ( = n~^ri^ 
by (p2]3) = -Ad:-.ad*(,,^^ ^^.^vr 

= -ad:(Ad;-i7r) 



by ([8:52|i) = -ad:-. 



In turn, using u = gg ^ and n = gr], from the Euler-Poincare equation we get the conservation 
law. 



d 

dt 



(Ad*-i7r) = Ad*-i (tt — ad*7r) 



where v: = ri ^t) is the left-invariant template velocity. 

Remark 8.12. (Interpretation of the equations) 

1. The conservation laws for Ad*(5£/(5M) and Ad*_i7r provide the interpretations of the 
momentum dynamics. Namely, the momentum 51 /5u (resp. tt) undergoes coadjoint 
motion with respect to g (resp. ri~^). 



2. The peculiar form of the momentum equation (8.52 d) is then understood, because the 
template velocity v is proportional to image momentum tt by a factor of the penalty 
constant, which also maps it from the dual of the Lie algebra, back to Lie algebra, 
namely, 

v:= ri^^r] = rT^ Uri = c^vr". 

Perhaps not unexpectedly, when — )■ the template velocity vanishes and the re- 
maining image motion reduces to a pure deformation governed by the Euler-Poincare 
equation. 
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3. The metamorphosis {gt,f]t) is determined as an initial value problem, as follows. Given 
the Lagrangian £{u), the Euler-Poincare equation 

il^ + ad* — -0 
dt 6u ^ 5u 

determines the velocity u = gg~^ which then yields gt by reconstruction from solving 
gt = Utgt- Next, the relations 

7r = ad*^^_^^ _^^^|7^ and aV = C-Ad„-iu, 

with C = n~^n and n = un + Vn need to be negotiated to obtain the image curve n^. 
Finally, the template curve is obtained from rft = g^^nt. This process is worth discussing 
in an example. ♦ 

8.5.2 Example: Metamorphosis equations on SE(2) 

In SE{2) the manifold of "deformable objects" N = M.'^ is acted upon by the Lie group of 
"deformations" G = S0{2) on the left. The situation simplifies in this case because iV is a 
vector space and we recover the setting described in §8.1[ Hence, 



iM{u,n,u) = i{u) + ^\\u\\^ = i{u) + ^\\h - un\\'^ = iLp{u,n,h) 
and the cost function becomes 

Sd = £ (^i{u) + ^\\i^f'^dt = £ (^i(u) + ^\\h-unf'^ dt, 

where the se(2) Lie algebra action un may be written on as a cross product of vectors 

un = uzx n. 



Consequently, the SDP metamorphosis equations (see (8.7)) 

5i 

hvron = 0, 



Su 

TT — UTT = 



h — un = cr^vr" = i/, 



may be written in vector form as 

Si 



z + TT xn = 0, 



6u 

TC — UZ X TT = 



h — UZXn = (T^TT = u. 



A few statements may be made about the qualitative properties of the solutions of this 
system. 
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1. We first observe that |7r| is constant because by the second equation above, we have 

Ivrp = 27r • 77 = 27r ■ {uz x vr) = 0. So vr executes circular motion in the plane at constant 
rotation frequency vr x 7r/|7rp = -uz. 

2. Substituting the second and third equations into the time derivative of the first one 
yields the conservation law, 

d Si _^ 
dt 6u ' 

for the planar motion. In particular, we obtain the constant of motion vr x n = const. 

3. The other two equations are closed provided one may solve 6i/6u for u, which of 
course we shall assume is possible. More precisely, we now assume that the Legendre 
transformation u^6i/6u is a diffeomorphism. In this case, since Si/6u is constant, u 
is also constant. 

4. It remains to determine the effects of 7^ on the dynamics of n. A short computation 
shows that: ^ ^ 

— (tt ■ = (j^IttP and — |nP = 2cr^(7r-n), 
dt^ ^ ' ' dr ^ ^ ^ 

so, since |vrp= const, increases quadratically with scaled time a'^t and the mo- 

tion may be visualized as taking place in with coordinates {xi,X2,X3) = (|7rp, |r;,p,7r- 
n) along the parabolas formed by intersections of level sets of the two integrals of 
motion 177^= constant and |7rxnp= const. The rotation frequency of n is found as 

nxh f a"^ 51 

■-Z u + 



np V InP^M 



As a^t— T-oo, the directions of the vectors vr and n tend toward a state of alignment, 
rotating together at frequency uz. In contrast, for cr^ = 0, the vectors vr and n keep their 
magnitudes and rotate together at frequency uz with constant relative orientation. 

8.5.3 Lie-Poisson Hamiltonian formulation of metamorphosis for right action 



In this example we particularize the system of motion equations (4.8) to the case of a 
representation but without imposing the endpoint condition at t = 1. The resulting equations 
are obtained by metamorphosis reduction from an arbitrary Lagrangian L : T(G x 1^) — j-M, 
where is a vector space. Thus, the equations below are more general that those obtained 
in the penalty approach. 

As explained in Section [sj the variational problem optimizes over metamorphoses {gt,i]t) 
by minimizing 5* = Ldt, for a Lagrangian L of the form 

L{9t,9t,Vt,Vt) = ^9t,9t,Vt) + ^WgtVtf , 

with fixed boundary conditions for the initial and final images no and ni, with image nt = 9tf]t 
for template rjt and go = 16.0', thus only the images are constrained at the endpoints. 

For the concrete metamorphosis example, the group G of diffeomorphisms Diff(P) 3 9 of 
the domain V is taken to act on the space of smooth maps (images) V = T{T>) 31] by the left 
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action grj:=rjog ^ of G on V. Therefore, the right action (8.50) of G on G x is given in 



this case by {g,ri)h:={goh,rioh) for g,hGDiS{V) and ri^J^(T>). The reduced Lagrangians 
C-Lpiut^ritjht) and iM{ut,nt,i't) are defined on the space qxV xV. In imaging apphcations, 
Ut = gtgt^ is the velocity along the optimal path gt sought between two images; nt: = gtrjt is 
the path in the image space; and vt ■=gtVt is the image velocity. 

From a visual point of view, image metamorphoses are similar to what is usually called 
"morphing" in computer graphics. The evolution of the image over time, t^rit, is a combi- 
nation of deformations and image intensity variation. Algorithms and experimental results 
for the solution of the boundary value problem (minimize the time-integrated Lagrangian 
between two images) can be found in [601 [25] . 



From the general metamorphosis equations (4.8) (with the minus sign corresponding to 



the right action of G on G xV) we obtain the dynamical system 



)Im , 61m 
ou on 



'M 



6v 



oi/t = 0. 



d 61 



M 



61m 6i 



dt 6iy 



-Uf 



M 



6^ 6n 



(8.53) 



ht = vt + utnt, 



6i 



M 



,1) + ^^(1)0^^1 



0, 



6u 6v 

where for n&V, aG V*, and X{V) =q, the infinitesimal actions and the diamond opera- 
tors are given by 

un = —dn -uEV = J^iV) , 
ua = div(aM) eV* = T{V)* = J^{V) , 
noa = -adneQ* = Q}{V). 

Even though we fixed the standard volume form on D C M" so densities on V are identified 
with functions and one-form densities with one-forms, we recall that one should think of ua 
density and no a as a one-form density. 
In contrast to earlier sections, fixed endpoints at t = 1 are not assumed in metamorphosis. 



This difference leads to the last equation in the system (8.53). For details of the derivation 



of the system (8.53) and discussions of the regularity of its solutions, see 



System (8.53) describes coadjoint motion 

d_ 
dt 



61 



M 



6i 



M 



6u 6v 



■on 



54./ , 6(.M 
ou ou 



or, equivalent ly, 



so that 



Ad 



9t 



6iM ^ 6iM 

6u 5z/ 



^ 6iM 

6u 6iy 



--Ad*-i 

9t 



6£ 



M 



+ ■ 



-M 



6u 6^ 



on 



(8.54) 



155) 



(8.56) 



t=o 



for the coadjoint action of the Lie group G on the dual of its Lie algebra q. 
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Hamiltonian formulation 



One passes from the Euler-Poincare metamorphosis equations on the Lagrangian side to 
their Lie-Poisson Hamiltonian formulation via the Legendre transformation; see the 
presentation and general formulas at the end of Section [6) The Legendre transformation 
of the reduced Lagrangian iMiUjn,^) : qxV xV ^M. in its variables u and u defines the 
Hamiltonian, 

h{iJ,,n,(3) = {fi,u) + {(3,u)-iM{u,n,u), (8.57) 



on g* xV xV*, where 



'M 



and (3 ■ 



5i 



M 



(8.58) 



6u 5z/ 

are given by the Legendre transformation. The variational derivatives of the Hamiltonian h 
are 

Jp^'^^ ir = — —■ (8-59) 



6h 
6fi 



6h 
6n 



Sn 



Consequently, the Euler-Poincare equations (8.53) for metamorphosis in the Eulerian de- 



scription imply the following equations, for the Legendre-transformed variables, (/i,n,/3), 
written as a matrix operation, symbolically as 



di 







adp/i 


-□on /3on" 




5h/5n 




Sh/6fj, 


n 




-□n 


-1 




6h/6n 


= :B 


Sh/6n 


P. 






1 




_6h/5p_ 




_6h/6f3_ 



i.60) 



with boxes □ indicating where the substitutions occur. These equations can also be obtained 



from the system (6.10) (with minus sign chosen in =p) by explicitly computing every term for 
this situation. The Poisson bracket defined by the skew-symmetric Hamiltonian matrix 
B is given by 



{/,/i}(/i,n,/3): 







T 


6h/6fi 




5f/Sn 


B 


6h/6n 




5f/SP 




_Sh/S(3_ 



dx. 



(8.Q1] 



The pair {n,P) satisfies canonical Poisson-bracket relations. The other parts of the Poisson 
bracket are linear in the variables (/i,n,/3). This linearity is the signature of the Lie-Poisson 
bracket on the dual of the semidirect product Lie algebra of vector fields X(V) acting on 
functions J-'{V,W) and its dual J^{V,W*) with a canonical cocycle between them. The 
semidirect product Lie algebra bracket on gxV xV is 

[{u,n,h'),{u,n,h')] = {[u,u],un — un,^ — uu) . 



A similar Lie-Poisson bracket was found for complex fiuids in [MJ . Ongoing work in this 
direction includes a Lagrange- Poincare formulation of these equations ( [29j ) . 



9 Conclusions and outlook 



This paper has begun the development of the family of dynamical systems associated with 
optimal control and optimization problems. The theory was developed in the context of many 
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examples inspired by control theory and optimization, particularly in the new area of appli- 
cations in imaging analysis of the theory of metamorphosis, a means of optimally tracking 
the changes of shape necessary for registration of images of various types, or data structures, 
without requiring that the transformations of shape be diffeomorphisms. The main idea was 
to soften the exact dynamical constraint by replacing it with a quadratic penalty term. The 
resulting optimization dynamics was studied by using methods that originated in geometric 
mechanics. In particular, Lagrange-Poincare reduction and its associated variational formu- 
lations were adapted to this sort of optimal inexact reduction. This approach allowed us to 
obtain the equations of metamorphosis dynamics that are naturally generated by the sta- 
tionarity conditions, then study their properties from both the Lagrangian and Hamiltonian 
points of view. 

This geometric setup for optimization dynamics was illustrated in diverse examples in 
Sectionjsj Besides metamorphosis ( ^8.5 ), these examples included optimally reduced versions 
of the heavy top (^8.1.1), the double bracket equations (^8.1.2), the A^-dimensional free 
rigid body ( ^8.3.1 ), the Euler equations for an inviscid ideal fluid both incompressible and 
compressible (^8.3.2), and the iV-dimensional Camassa-Holm equation (^8.41). For the one- 



dimensional Camassa-Holm equation the optimal reduction process produced its integrable 
Hamiltonian extension, the two-component Camassa-Holm equations in (8.48) and (8.49). 

We plan to continue the investigation of the relationships among problems in imaging, 
optimal control, and geometric mechanics. In particular, we plan to continue developing the 
dynamical systems framework for designing and interpreting methods of large deformation 
matching for image registration in computational anatomy. 
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